INDEX
Explanations
prepositions followed by specific locations or contexts
references to political events or situations
New Auto-Interp
Negative Logits
eret
-0.73
Reviewed
-0.68
imar
-0.67
bsite
-0.66
íķ
-0.64
IAS
-0.63
Fab
-0.62
closet
-0.62
derog
-0.61
perspect
-0.61
POSITIVE LOGITS
marks
0.80
keeping
0.75
score
0.67
sburgh
0.66
tery
0.63
EMP
0.59
alion
0.58
okane
0.58
Corm
0.57
mark
0.57
Activations Density 0.000%