INDEX
Explanations
phrases indicating quantity or numerical references
New Auto-Interp
Negative Logits
Cosponsors
-0.71
Fenrir
-0.68
vir
-0.63
âĺ
-0.61
aque
-0.60
Tanz
-0.59
sensit
-0.59
helle
-0.58
mare
-0.57
pots
-0.57
POSITIVE LOGITS
aneers
0.67
nine
0.67
eight
0.66
seven
0.63
six
0.61
four
0.61
three
0.60
attempted
0.60
consecutive
0.59
agements
0.59
Activations Density 0.039%