INDEX
Explanations
the beginning of the text
Follows certain punctuation or special characters
Russian phenotypic features
New Auto-Interp
Negative Logits
,
-0.82
.
-0.79
-
-0.78
:
-0.70
–
-0.69
B
-0.65
(
-0.64
-0.60
a
-0.57
@
-0.56
POSITIVE LOGITS
myſelf
1.47
itſelf
1.44
purpoſe
1.40
pleaſure
1.39
houſe
1.36
تقاوى
1.36
GenerationType
1.34
Anſ
1.30
تضيفلها
1.29
Houſe
1.29
Activations Density 0.122%