INDEX
Explanations
conjunctions and references to online experiences or services
New Auto-Interp
Negative Logits
-0.76
-0.71
↵↵
-0.65
(
-0.63
1
-0.57
↵
-0.56
the
-0.55
.
-0.54
'
-0.54
?
-0.53
POSITIVE LOGITS
itſelf
1.14
Мексичка
1.13
Efq
1.11
Houſe
1.07
pleaſure
1.07
ſever
1.06
Monfieur
1.03
Reſ
1.03
useRouter
1.01
་་
1.00
Activations Density 0.471%