INDEX
Explanations
words related to measurements or quantities
terms related to specific food or drink products
New Auto-Interp
Negative Logits
�
-0.64
..........
-0.63
-0.62
âĢİ
-0.59
�
-0.56
��
-0.55
``
-0.54
âĸł
-0.51
âĢº
-0.50
-0.49
POSITIVE LOGITS
Annotations
0.56
pload
0.54
guiActiveUn
0.53
erker
0.51
uci
0.51
pedia
0.51
ultimate
0.51
gyn
0.50
ghazi
0.50
Hera
0.50
Activations Density 3.515%