INDEX
Explanations
instances of the character 'v' or variants of it in different contexts
New Auto-Interp
Negative Logits
raiſ
-1.12
ſche
-0.97
myſelf
-0.97
ſelf
-0.95
Majefty
-0.91
purpoſe
-0.91
ſelves
-0.90
iſt
-0.89
Theſe
-0.89
་་
-0.88
POSITIVE LOGITS
v
2.15
v
1.82
vv
1.09
vv
0.98
v
0.97
Fv
0.94
w
0.91
vlog
0.88
u
0.85
vB
0.82
Activations Density 0.142%