INDEX
Explanations
instances of the word "bit" within various contexts
references to incremental or gradual changes
New Auto-Interp
Negative Logits
ammad
-0.76
etheus
-0.71
pend
-0.67
theless
-0.64
velt
-0.64
Pend
-0.64
Vaj
-0.64
gaard
-0.63
anguage
-0.63
Duty
-0.62
POSITIVE LOGITS
terness
1.25
umen
1.13
ches
1.12
buck
1.02
ching
1.00
wig
0.99
umin
0.97
chery
0.97
ters
0.91
meal
0.91
Activations Density 0.026%