INDEX
Explanations
plural nouns
mentions of multiple entities or items
New Auto-Interp
Negative Logits
ahime
-0.66
Activities
-0.64
SPONSORED
-0.63
Leth
-0.63
Fre
-0.62
rum
-0.62
Anthrop
-0.62
Boxing
-0.61
Newsletter
-0.60
humane
-0.60
POSITIVE LOGITS
apiece
1.21
totaling
1.01
poons
0.93
paces
0.90
hips
0.84
thirds
0.79
simultaneously
0.77
ixt
0.75
consecut
0.75
agos
0.74
Activations Density 0.255%