INDEX
Explanations
elements related to addresses and affiliations, such as departments, universities, and locations
New Auto-Interp
Negative Logits
?
-0.46
tab
-0.43
<eos>
-0.39
متعلقه
-0.38
offerta
-0.37
사
-0.36
componentWill
-0.36
findall
-0.36
...
-0.36
率
-0.35
POSITIVE LOGITS
الرياضيه
1.02
Anſ
0.90
Perſ
0.87
Diſ
0.84
Reſ
0.83
Inſ
0.82
PLWABN
0.82
ſind
0.81
تضيفلها
0.80
awtextra
0.79
Activations Density 0.346%