INDEX
Explanations
informal language involving places or organizations, like nicknames or abbreviations
places and organizations
New Auto-Interp
Negative Logits
spring
-0.80
ERN
-0.76
ELL
-0.72
andise
-0.71
orage
-0.69
ebted
-0.69
ifies
-0.69
eers
-0.68
izations
-0.68
ifying
-0.67
POSITIVE LOGITS
volent
0.82
vious
0.80
judicial
0.77
gress
0.76
phrine
0.76
AFTA
0.72
llor
0.69
mand
0.69
pless
0.69
boro
0.67
Activations Density 0.027%