INDEX
Explanations
phrases that indicate setting or configuration instructions
New Auto-Interp
Negative Logits
AndEndTag
-0.93
ostavi
-0.88
myſelf
-0.87
يتيمه
-0.85
الحره
-0.82
Jefus
-0.82
perſon
-0.81
houſe
-0.80
itſelf
-0.79
Anſ
-0.78
POSITIVE LOGITS
0.67
nonatomic
0.63
0.53
чив
0.52
<
0.51
zies
0.50
a
0.48
írás
0.48
indi
0.47
hav
0.47
Activations Density 0.046%