INDEX
Explanations
expressions related to social interactions and conflicts
comma usage in sentences with contrasting ideas or clauses
New Auto-Interp
Negative Logits
tains
-0.88
itational
-0.74
ãĥīãĥ©
-0.68
nergy
-0.67
qqa
-0.66
ãĥ¯
-0.65
tesy
-0.63
cellence
-0.63
luence
-0.62
eatures
-0.62
POSITIVE LOGITS
fearing
1.31
afraid
1.14
worried
1.03
preferring
1.01
ashamed
1.00
believing
0.99
realizing
0.98
wondering
0.98
feared
0.97
thinking
0.97
Activations Density 0.396%