INDEX
Explanations
the word "basing" in various contexts
New Auto-Interp
Negative Logits
Oaks
-0.67
Toll
-0.56
Rollins
-0.56
WIND
-0.56
hood
-0.55
conscience
-0.55
theless
-0.55
WORK
-0.54
Goods
-0.54
flair
-0.54
POSITIVE LOGITS
sembly
1.04
ements
1.00
emen
0.98
cule
0.95
estation
0.94
idi
0.89
ility
0.88
uit
0.86
imet
0.85
ename
0.84
Activations Density 0.015%