INDEX
Explanations
phrases indicating abundance or availability
New Auto-Interp
Negative Logits
ียà¸ļ
-0.14
jem
-0.14
rej
-0.14
IVEN
-0.13
Dut
-0.13
536
-0.13
Unblock
-0.13
лив
-0.13
asthan
-0.13
isoft
-0.13
POSITIVE LOGITS
ways
0.19
yyy
0.15
autres
0.15
-bodied
0.15
fold
0.14
other
0.14
others
0.14
ertime
0.14
489
0.13
äll
0.13
Activations Density 0.033%