INDEX
Explanations
phrases related to expressing opinions or emotions strongly
expressions of indifference or disregard
New Auto-Interp
Negative Logits
Ct
-0.80
iba
-0.69
ById
-0.68
eness
-0.65
ODY
-0.64
yrinth
-0.63
ourge
-0.61
Jah
-0.61
orial
-0.59
xtap
-0.59
POSITIVE LOGITS
thumbs
1.18
impression
1.16
keynote
0.92
nod
0.90
speeches
0.87
shout
0.83
hint
0.83
indication
0.79
cred
0.76
rundown
0.75
Activations Density 0.131%