INDEX
Explanations
phrases that indicate possession or association
New Auto-Interp
Negative Logits
cela
-0.16
Bent
-0.16
unks
-0.15
obo
-0.14
nyder
-0.14
occan
-0.14
assi
-0.14
/native
-0.14
asaki
-0.13
abaj
-0.13
POSITIVE LOGITS
venues
0.15
oice
0.15
importance
0.14
ibu
0.14
Shields
0.14
.mass
0.14
venue
0.13
concern
0.13
elia
0.13
bile
0.13
Activations Density 0.298%