INDEX
Explanations
mentions of the name "Ale" followed by a number indicating the strength of activation
references to "Ale" beer types or brands
New Auto-Interp
Negative Logits
enegger
-0.91
ãĤ¼ãĤ¦ãĤ¹
-0.87
ä¹ĭ
-0.85
ãģį
-0.74
Everyday
-0.69
ãģı
-0.68
aneously
-0.65
CFR
-0.64
ledged
-0.64
ãĥķãĤ©
-0.64
POSITIVE LOGITS
jandro
1.24
ppo
0.98
Ale
0.94
lette
0.94
oga
0.90
rique
0.87
terness
0.86
ogan
0.83
thia
0.82
vin
0.81
Activations Density 0.006%