INDEX
Explanations
mentions of typing activities
New Auto-Interp
Negative Logits
Territories
-0.68
=~=~
-0.64
Gamble
-0.59
endors
-0.59
ADS
-0.58
ordial
-0.58
mos
-0.57
Supplement
-0.57
plement
-0.56
Bey
-0.56
POSITIVE LOGITS
ahead
1.09
aloud
1.08
face
0.87
faces
0.85
codes
0.84
mitted
0.72
ilde
0.71
typing
0.70
code
0.70
oshop
0.70
Activations Density 0.025%