INDEX
Explanations
phrases expressing personal reactions or experiences
expressions of personal appreciation and satisfaction
New Auto-Interp
Negative Logits
farious
-0.69
violations
-0.66
guiName
-0.63
breaches
-0.63
Occupations
-0.62
objections
-0.61
etsk
-0.61
provocation
-0.61
ãĤ¼
-0.60
impunity
-0.59
POSITIVE LOGITS
adore
1.17
congratulate
1.09
'm
1.09
hope
1.08
love
1.07
LOVE
1.07
loved
1.05
enjoyed
1.01
salute
1.01
appreciate
0.99
Activations Density 0.188%