INDEX
Explanations
mentions of the name "Angela" and related terms, particularly in contexts that evoke strong emotional responses or themes of anger
New Auto-Interp
Negative Logits
ksen
-0.18
podob
-0.16
etat
-0.15
ouser
-0.15
æ½
-0.15
etto
-0.15
atives
-0.15
ahun
-0.15
acles
-0.15
etten
-0.15
POSITIVE LOGITS
uish
0.26
ered
0.24
lic
0.23
strom
0.21
kor
0.21
gota
0.21
olan
0.20
iolet
0.20
leton
0.20
ry
0.20
Activations Density 0.008%