INDEX
Explanations
references to the word "it" in various contexts
New Auto-Interp
Negative Logits
Majefty
-0.95
himſelf
-0.87
raiſ
-0.86
Monfieur
-0.86
Houſe
-0.85
myſelf
-0.84
houſe
-0.84
poffe
-0.84
ſtate
-0.81
whoſe
-0.79
POSITIVE LOGITS
It
0.68
wasn
0.67
It
0.60
it
0.60
Wasn
0.57
wouldn
0.56
quoi
0.56
The
0.54
ça
0.54
playSound
0.54
Activations Density 0.153%