INDEX
Explanations
statements asserting the existence or condition of something
New Auto-Interp
Negative Logits
surla
-0.54
SharedCtor
-0.54
للاسماء
-0.51
AnchorStyles
-0.50
vestream
-0.49
ỡng
-0.49
للمعارف
-0.48
éez
-0.48
مشين
-0.45
мәкал
-0.44
POSITIVE LOGITS
true
2.39
True
2.20
true
2.08
True
2.05
TRUE
1.77
TRUE
1.59
vrai
1.48
truth
1.36
verdade
1.31
правда
1.24
Activations Density 0.078%