INDEX
Explanations
non-English characters, potentially indicating it is looking for specific languages or unique character patterns
character sequences or special characters that suggest a specific encoding or language
New Auto-Interp
Negative Logits
perse
-0.75
poaching
-0.70
livest
-0.70
scarce
-0.69
dexter
-0.68
comfort
-0.65
nutrit
-0.64
dipping
-0.64
traff
-0.64
Wan
-0.64
POSITIVE LOGITS
ÑĮ
1.31
ÑĢ
1.24
о
1.22
а
1.20
в
1.12
е
1.12
и
1.11
Ñģ
1.10
ments
1.08
оÐ
1.07
Activations Density 0.009%