INDEX
Explanations
quantities or measurements mentioned in the text
words related to measurements and quantities of consumable items
New Auto-Interp
Negative Logits
architecture
-0.76
incumb
-0.68
invari
-0.68
avid
-0.64
Ruins
-0.63
arche
-0.62
stru
-0.62
canonical
-0.62
pioneer
-0.61
backward
-0.60
POSITIVE LOGITS
bottles
1.41
bottle
1.28
cans
1.14
cigarettes
1.01
whiskey
0.99
vodka
0.97
liquor
0.96
cig
0.95
batches
0.93
alcohol
0.91
Activations Density 0.253%