INDEX
Explanations
questions starting with the word "Who"
instances of the word "who."
New Auto-Interp
Negative Logits
emin
-0.64
PORT
-0.64
GV
-0.62
immersion
-0.62
Globe
-0.61
saturation
-0.61
MER
-0.61
compatibility
-0.58
rocket
-0.58
Pilgrim
-0.57
POSITIVE LOGITS
soever
1.18
ever
1.16
cares
1.15
else
1.15
oping
1.09
knows
1.04
ops
0.99
oped
0.92
osh
0.89
opsy
0.84
Activations Density 0.039%