INDEX
Explanations
elements related to formal structures and organizational contexts
New Auto-Interp
Negative Logits
الرياضيه
-0.82
“
-0.81
"
-0.70
becauſe
-0.64
、「
-0.64
,“
-0.60
,「
-0.60
respectively
-0.57
виправивши
-0.57
-0.57
POSITIVE LOGITS
」
0.90
”-
0.87
”
0.85
"}
0.84
"-
0.80
"
0.78
”—
0.78
"?
0.78
”?
0.78
'}
0.77
Activations Density 0.297%