INDEX
Explanations
phrases that indicate a general quality or property of something
New Auto-Interp
Negative Logits
Fox
-0.15
جر
-0.15
ạn
-0.15
als
-0.15
iro
-0.15
ittel
-0.14
leck
-0.14
rtc
-0.14
ystick
-0.14
impan
-0.14
POSITIVE LOGITS
941
0.16
yanı
0.15
enberg
0.14
celik
0.14
DU
0.14
лива
0.14
SKTOP
0.14
ombat
0.14
ëĬIJ
0.14
δÏģο
0.13
Activations Density 0.122%