INDEX
Explanations
phrases indicating a higher degree or comparison
phrases indicating likelihood or correlation
New Auto-Interp
Negative Logits
xton
-0.65
ILCS
-0.65
avin
-0.62
mberg
-0.61
;;;;;;;;;;;;
-0.61
Flag
-0.60
motion
-0.59
iasis
-0.59
Assass
-0.58
omen
-0.58
POSITIVE LOGITS
than
1.75
than
1.66
Than
1.16
erous
0.74
efficient
0.63
harsher
0.63
wiser
0.61
udeb
0.61
manageable
0.60
relative
0.58
Activations Density 0.756%