INDEX
Explanations
references to tea or afternoon tea-related events
New Auto-Interp
Negative Logits
elman
-0.16
istrovstvÃŃ
-0.16
ilst
-0.16
warz
-0.14
berapa
-0.14
isman
-0.14
aad
-0.14
bih
-0.14
óż
-0.14
odafone
-0.14
POSITIVE LOGITS
bubbles
0.46
bubb
0.43
Spark
0.35
sparkling
0.34
Spark
0.34
fizz
0.33
bubble
0.33
Champagne
0.33
Bubble
0.33
champ
0.32
Activations Density 0.043%