INDEX
Explanations
proper nouns or people's names
terms associated with organizations and political groups
New Auto-Interp
Negative Logits
ournals
-0.68
nesday
-0.58
camel
-0.54
idan
-0.50
Marriott
-0.50
showc
-0.49
Aram
-0.49
ffect
-0.48
idious
-0.47
cour
-0.47
POSITIVE LOGITS
ibrary
0.62
overe
0.60
abulary
0.57
otta
0.55
QL
0.55
otide
0.53
ovych
0.52
oxide
0.52
oshenko
0.51
ombat
0.50
Activations Density 1.318%