INDEX
Explanations
queries or questions starting with "Who"
the phrase "Who" as a recurring element, indicating inquiries about individuals or entities
New Auto-Interp
Negative Logits
MER
-0.86
PORT
-0.68
Hyde
-0.68
rog
-0.63
Rye
-0.62
York
-0.61
BACK
-0.60
personal
-0.58
Et
-0.57
OOL
-0.57
POSITIVE LOGITS
soever
1.16
ever
1.02
oping
1.01
abouts
0.96
else
0.93
oped
0.89
cares
0.86
knows
0.85
osh
0.82
zbollah
0.81
Activations Density 0.085%