INDEX
Explanations
references to geographic locations and proper nouns
Followed by "ider", "rough", "olu", "izi", or "inawa"
New Auto-Interp
Negative Logits
,
-0.59
-0.58
(
-0.53
the
-0.51
,
-0.51
next
-0.50
:].
-0.50
for
-0.50
with
-0.49
<eos>
-0.49
POSITIVE LOGITS
Majefty
0.91
Efq
0.85
مرئيه
0.76
doubtnut
0.76
OGND
0.76
TextAppearance
0.75
purpoſe
0.74
Partagez
0.72
τογραφ
0.70
ſelves
0.70
Activations Density 0.283%