INDEX
Explanations
special characters and formatting cues in the text
New Auto-Interp
Negative Logits
Shakspeare
-0.63
vixion
-0.61
ovp
-0.59
piaggio
-0.57
}:${-0.57
itſelf
-0.56
yaris
-0.56
Voit
-0.56
xodo
-0.55
domestique
-0.55
POSITIVE LOGITS
<bos>
1.36
ècie
1.01
findpost
0.89
arşivlendi
0.87
GraphicsUnit
0.85
الحره
0.84
تانيه
0.82
tvguidetime
0.81
HomeAsUpEnabled
0.80
незавершена
0.78
Activations Density 0.265%