INDEX
Explanations
personal reflections or emotional responses in text
expressions of personal feelings and emotions
New Auto-Interp
Negative Logits
tesy
-0.64
srfAttach
-0.61
Pigs
-0.60
Course
-0.60
stice
-0.60
Globe
-0.59
Miko
-0.59
ibrary
-0.58
Door
-0.58
Via
-0.58
POSITIVE LOGITS
realise
1.18
realize
1.17
feel
1.07
rethink
1.01
aware
1.01
hesitate
1.01
reconsider
1.01
cringe
1.00
uneasy
0.96
accountable
0.96
Activations Density 0.071%