INDEX
Explanations
organizations or individuals with specific names
commas and punctuation indicating lists or separations in text
New Auto-Interp
Negative Logits
irie
-0.73
sburg
-0.72
odes
-0.69
rast
-0.65
izen
-0.65
othe
-0.64
osexual
-0.63
ood
-0.61
sein
-0.60
uces
-0.60
POSITIVE LOGITS
which
1.21
whose
1.20
aka
1.06
whose
1.05
which
0.96
although
0.94
who
0.93
whereas
0.87
namely
0.84
Magikarp
0.83
Activations Density 0.408%