INDEX
Explanations
references to alcoholic beverages and their production processes
New Auto-Interp
Negative Logits
IDE
-0.16
626
-0.15
elo
-0.15
checks
-0.15
nets
-0.15
/INFO
-0.14
ylland
-0.14
accepted
-0.14
ozor
-0.14
321
-0.14
POSITIVE LOGITS
clid
0.15
DataTask
0.14
ihn
0.14
è±Ĭ
0.14
اÙĦسÙħ
0.14
vak
0.14
ản
0.14
assis
0.14
нок
0.14
ãĥ³ãĥĸ
0.13
Activations Density 0.026%