INDEX
Explanations
questions starting with "Who" that are related to various topics
references to inquiries or questions about identity
New Auto-Interp
Negative Logits
MER
-0.94
PORT
-0.70
BACK
-0.67
interstitial
-0.66
England
-0.64
Hyde
-0.63
OOL
-0.61
outer
-0.61
Glob
-0.60
Rog
-0.60
POSITIVE LOGITS
soever
1.16
ever
0.97
oping
0.95
else
0.91
abouts
0.87
zbollah
0.86
cares
0.85
knows
0.83
oped
0.82
ileaks
0.80
Activations Density 0.120%