INDEX
Explanations
mentions of specific names or terms associated with characters or entities
New Auto-Interp
Negative Logits
senal
-0.86
Blueprint
-0.71
aeda
-0.69
deviations
-0.68
Reloaded
-0.64
Transparency
-0.64
inately
-0.64
body
-0.61
nces
-0.60
principals
-0.60
POSITIVE LOGITS
ipe
1.05
ipeg
1.05
liness
0.93
borough
0.90
Mandela
0.88
ifest
0.83
emon
0.83
lette
0.83
ijah
0.82
ners
0.82
Activations Density 0.006%