INDEX
Explanations
references to locations and geographical entities
New Auto-Interp
Negative Logits
كومونز
-0.57
httphttps
-0.52
Rhestr
-0.51
يتيمه
-0.50
afficheront
-0.47
tagHelperRunner
-0.47
ब्रेकडाउन
-0.45
للاسماء
-0.45
IsContent
-0.44
بيها
-0.44
POSITIVE LOGITS
<eos>
1.00
✭✭
0.45
endregion
0.43
depart
0.42
orde
0.42
enfans
0.41
concludes
0.41
https
0.41
Kars
0.40
Италијани
0.40
Activations Density 0.302%