INDEX
Explanations
phrases that emphasize the quantity or significance of an entity, particularly in terms of totals or counts
New Auto-Interp
Negative Logits
pring
-0.63
Presence
-0.61
condem
-0.60
nesses
-0.56
era
-0.56
trave
-0.55
tera
-0.54
pod
-0.54
ness
-0.53
adays
-0.53
POSITIVE LOGITS
eight
1.05
seven
1.00
six
1.00
nine
0.97
thirteen
0.97
THREE
0.93
fourteen
0.93
nineteen
0.91
four
0.89
seventy
0.89
Activations Density 0.020%