INDEX
Explanations
names of people and places
references to a specific brand or product related to beer
New Auto-Interp
Negative Logits
ALLY
-0.73
LESS
-0.68
GOODMAN
-0.68
acion
-0.67
ulators
-0.66
ORE
-0.66
eering
-0.64
amide
-0.63
ARC
-0.60
CAR
-0.60
POSITIVE LOGITS
ught
1.17
fters
1.15
cffff
1.04
enei
0.97
dra
0.94
plets
0.92
ven
0.91
isine
0.87
uth
0.85
fter
0.84
Activations Density 0.012%