INDEX
Explanations
expressions emphasizing agreement or understanding
the word "totally" and its variations used for emphasis
New Auto-Interp
Negative Logits
pring
-0.79
rers
-0.79
llor
-0.78
liest
-0.76
åº
-0.75
ulative
-0.74
mere
-0.74
lest
-0.70
ourses
-0.70
ently
-0.68
POSITIVE LOGITS
obliter
0.73
unrelated
0.72
heartedly
0.68
STAR
0.67
annihil
0.67
ect
0.64
allo
0.64
ove
0.63
und
0.63
ogen
0.63
Activations Density 0.012%