INDEX
Explanations
phrases and terms indicating repetition or emphasis on new ideas or information
New Auto-Interp
Negative Logits
autres
-0.99
demais
-0.90
demás
-0.83
other
-0.81
other
-0.80
others
-0.78
others
-0.78
lainnya
-0.71
còn
-0.70
autres
-0.70
POSITIVE LOGITS
couple
0.74
dozen
0.73
}}"></
0.70
layer
0.70
important
0.70
huge
0.69
interesting
0.68
paio
0.66
worldly
0.66
few
0.65
Activations Density 0.114%