INDEX
Explanations
quotes or statements made by individuals
New Auto-Interp
Negative Logits
earances
-0.77
aphael
-0.75
hoff
-0.74
ption
-0.71
theless
-0.66
ammy
-0.65
hern
-0.65
————
-0.64
cific
-0.64
ohyd
-0.64
POSITIVE LOGITS
deems
0.76
XM
0.73
©¶æ¥µ
0.72
ij士
0.70
deem
0.70
IONS
0.68
é¾
0.67
¾
0.64
represents
0.64
ought
0.64
Activations Density 6.727%