INDEX
Explanations
mentions of milk or words related to milk
references to milk or milk-related products
New Auto-Interp
Negative Logits
Unch
-0.71
Warriors
-0.69
ORTS
-0.68
Sins
-0.66
CARD
-0.66
Result
-0.65
BOOK
-0.64
Fear
-0.64
Errors
-0.64
OPLE
-0.63
POSITIVE LOGITS
mil
1.16
isec
0.98
ieu
0.98
waukee
0.94
mil
0.93
ksh
0.92
fed
0.85
vin
0.79
enium
0.78
quet
0.77
Activations Density 0.004%