INDEX
Explanations
proper nouns referring to people named Karen
instances of the name "Karen."
New Auto-Interp
Negative Logits
etheus
-0.81
udic
-0.73
othal
-0.70
cision
-0.70
rid
-0.69
aneous
-0.68
derogatory
-0.67
Logged
-0.66
igated
-0.66
NSA
-0.65
POSITIVE LOGITS
Karen
1.19
Sue
0.93
lapt
0.80
ratom
0.77
Borders
0.75
Kuro
0.75
Silk
0.70
itably
0.70
ites
0.69
taining
0.69
Activations Density 0.007%