INDEX
Explanations
phrases related to criticism or negative judgment
unique punctuation marks and their frequencies in sentences
New Auto-Interp
Negative Logits
imum
-0.88
PTS
-0.81
Flavoring
-0.78
GEAR
-0.77
ERO
-0.76
inction
-0.75
ESA
-0.75
terday
-0.75
SHIP
-0.74
availability
-0.74
POSITIVE LOGITS
charismatic
1.01
bearded
1.00
thirsty
0.97
rebellious
0.95
adventurous
0.93
hungry
0.92
sweaty
0.92
haired
0.91
aggressive
0.91
educated
0.91
Activations Density 0.154%