INDEX
Explanations
instances of the word "many" indicating emphasis on quantity or prevalence
New Auto-Interp
Negative Logits
gate
-0.15
atters
-0.14
ibbon
-0.14
à¸Ļà¸ķ
-0.14
lete
-0.14
apı
-0.14
åĸ
-0.14
inders
-0.13
phies
-0.13
ddf
-0.13
POSITIVE LOGITS
867
0.15
oday
0.14
-times
0.14
(++
0.14
-lined
0.14
Gel
0.14
Volk
0.14
ardy
0.14
ough
0.14
powering
0.14
Activations Density 0.081%