INDEX
Explanations
references to specific research labs or institutions
lab or laboratory
New Auto-Interp
Negative Logits
Italijanski
-0.53
beginnetje
-0.47
ကိုးကား
-0.47
čenje
-0.46
ligiloj
-0.46
linguri
-0.46
فريبيس
-0.45
Controllo
-0.45
Източници
-0.44
Савезне
-0.44
POSITIVE LOGITS
lab
3.19
Lab
2.33
LAB
2.25
lab
2.11
Lab
2.05
LAB
1.93
labs
1.74
labs
1.47
Labs
1.45
laboratory
1.36
Activations Density 0.008%