INDEX
Explanations
references to wine and wine-related experiences
New Auto-Interp
Negative Logits
IOD
-0.17
aca
-0.16
rum
-0.15
Leh
-0.14
/native
-0.14
ç·ł
-0.14
rado
-0.14
umn
-0.13
rum
-0.13
#
-0.13
POSITIVE LOGITS
èĻ«
0.15
orsch
0.14
ichel
0.14
ái
0.14
raz
0.14
ullen
0.13
CEF
0.13
ä¾į
0.13
okud
0.13
schle
0.13
Activations Density 0.231%