INDEX
Explanations
phrases related to quantity or extent
phrases that include "of."
New Auto-Interp
Negative Logits
ÙIJ
-0.83
Ô
-0.70
chwitz
-0.68
¯¯
-0.68
resy
-0.67
Policy
-0.66
IER
-0.63
wark
-0.63
iland
-0.62
ertodd
-0.61
POSITIVE LOGITS
possibilities
0.76
digits
0.71
acron
0.69
bullets
0.67
contradictions
0.66
asses
0.64
ties
0.64
elements
0.61
beasts
0.60
totality
0.60
Activations Density 0.179%