INDEX
Explanations
punctuation and formatting cues related to data presentation
text following numbers
New Auto-Interp
Negative Logits
OGND
-0.52
RTSC
-0.51
Gön
-0.51
aapt
-0.44
Chham
-0.42
raisals
-0.40
DrawerToggle
-0.40
Diweddarwch
-0.39
apter
-0.39
alse
-0.39
POSITIVE LOGITS
SequentialGroup
0.48
Wikiseite
0.46
Sziasztok
0.43
imprimée
0.40
➌
0.39
ioterapia
0.38
fjor
0.38
0.38
Notae
0.38
biały
0.38
Activations Density 0.025%