INDEX
Explanations
the presence of the letter 'v' in various forms and contexts
New Auto-Interp
Negative Logits
myſelf
-0.90
ſelf
-0.88
raiſ
-0.88
Majefty
-0.82
ſelves
-0.82
elebr
-0.79
fevere
-0.78
ſche
-0.78
purpoſe
-0.77
neceffary
-0.73
POSITIVE LOGITS
v
1.72
v
1.67
vv
1.00
Fv
0.96
vv
0.91
Vv
0.88
v
0.88
脚注の使い方
0.86
rv
0.83
tv
0.82
Activations Density 0.119%