INDEX
Explanations
occurrences of the verb "are"
New Auto-Interp
Negative Logits
itſelf
-1.63
Jefus
-1.59
Theſe
-1.58
Majefty
-1.51
Monfieur
-1.50
Efq
-1.47
myſelf
-1.42
himſelf
-1.42
themſelves
-1.40
$_"
-1.39
POSITIVE LOGITS
am
1.26
Am
0.79
am
0.76
AM
0.75
im
0.73
Am
0.68
I
0.66
AM
0.64
0.61
N
0.60
Activations Density 0.236%