INDEX
Explanations
mentions of notable individuals and organizations in various contexts
New Auto-Interp
Negative Logits
glob
-0.17
genuine
-0.16
Gest
-0.15
gesture
-0.15
Std
-0.15
گرد
-0.15
gallon
-0.14
gesture
-0.14
gallery
-0.14
SH
-0.14
POSITIVE LOGITS
(G
0.31
GG
0.29
SG
0.29
GG
0.28
OG
0.26
FG
0.24
SG
0.24
GN
0.23
GP
0.23
MG
0.23
Activations Density 0.205%