INDEX
Explanations
words related to emotional reactions like fascination, excitement, and anger
emotional reactions and responses related to engagement and intrigue
New Auto-Interp
Negative Logits
©¶æ
-0.81
porary
-0.69
itled
-0.66
forearm
-0.65
uts
-0.65
glas
-0.63
slips
-0.63
nect
-0.62
uben
-0.62
abs
-0.61
POSITIVE LOGITS
ingly
1.05
onlook
0.95
audiences
0.92
us
0.89
me
0.85
him
0.78
passers
0.78
viewers
0.77
readers
0.74
sensibilities
0.74
Activations Density 0.182%