INDEX
Explanations
specific emotional terms or expressions related to family, deception, choices, and significant actions or events
various scripts and languages
New Auto-Interp
Negative Logits
fjspx
-0.95
WithIOException
-0.70
@"/
-0.70
RegistryLite
-0.69
دانشنامهٔ
-0.69
basicConfig
-0.67
CppMethod
-0.66
للاسماء
-0.65
snippetHide
-0.64
DoubleQuotes
-0.63
POSITIVE LOGITS
خ
1.78
خ
1.45
الخ
1.31
الخ
0.89
وخ
0.88
والخ
0.86
ख
0.77
X
0.70
تخ
0.67
يخ
0.66
Activations Density 0.001%