INDEX
Explanations
the presence of indicators of competitive interests and funding
start of phrase or sentence
New Auto-Interp
Negative Logits
estekak
-0.63
مشين
-0.58
الرياضيه
-0.57
<<<<<<<<<<<<<<
-0.53
stdc
-0.51
jsxFileName
-0.50
OOTDTY
-0.49
المناصب
-0.49
aternion
-0.48
ViewInit
-0.47
POSITIVE LOGITS
the
0.56
The
0.40
صفحۀ
0.36
daß
0.35
The
0.34
their
0.33
dostęp
0.33
从
0.32
))
0.32
}')
0.31
Activations Density 0.013%