INDEX
Explanations
occurrences of the word "few."
New Auto-Interp
Negative Logits
acus
-0.86
roit
-0.77
hes
-0.74
yrinth
-0.73
abal
-0.72
webkit
-0.72
UAL
-0.71
sein
-0.69
alam
-0.68
said
-0.68
POSITIVE LOGITS
dozen
1.36
hundred
1.25
thousand
1.07
weeks
1.00
dozen
0.95
months
0.90
days
0.89
nights
0.88
minutes
0.86
exceptions
0.84
Activations Density 0.048%