INDEX
Explanations
references to a certain place or person named "Iota"
occurrences of the substring "ota"
New Auto-Interp
Negative Logits
directions
-0.77
igree
-0.73
satire
-0.71
takedown
-0.71
groom
-0.70
stroke
-0.70
behavi
-0.68
aciously
-0.67
writers
-0.67
carbohyd
-0.67
POSITIVE LOGITS
ota
1.53
OTA
1.02
BILITY
0.92
iba
0.82
Luxem
0.79
Pharmaceutical
0.79
oya
0.77
ÄŁ
0.76
rolet
0.76
uta
0.75
Activations Density 0.005%