INDEX
Explanations
specific identifiers or codes related to various entities or categories
New Auto-Interp
Negative Logits
Bibliograf
-0.49
Ӏ
-0.47
Barbier
-0.47
Weid
-0.47
ThemeOverlay
-0.47
urati
-0.45
poved
-0.45
बै
-0.44
partic
-0.44
barrera
-0.43
POSITIVE LOGITS
PL
0.96
PLA
0.95
Pla
0.93
Pl
0.91
pla
0.91
PLAT
0.90
Plat
0.90
pla
0.89
PL
0.88
pl
0.88
Activations Density 0.233%