INDEX
Explanations
references to addresses and numerical data
numbers and punctuation
New Auto-Interp
Negative Logits
selfies
-0.60
autorytatywna
-0.59
'\\;'
-0.58
incentiv
-0.57
للمعارف
-0.53
selfie
-0.53
utafitiHapana
-0.53
expandindo
-0.52
StructEnd
-0.52
Cyfeiriadau
-0.51
POSITIVE LOGITS
faßt
0.62
Einfluß
0.52
aDecoder
0.49
Bewußt
0.48
muß
0.43
dentaire
0.42
daß
0.41
Saddam
0.41
vskip
0.41
BrowserModule
0.39
Activations Density 0.010%