INDEX
Explanations
phrases related to transformation or change
variations of the word "become."
New Auto-Interp
Negative Logits
mble
-0.74
rug
-0.65
uphill
-0.65
tightly
-0.62
commons
-0.61
DER
-0.60
park
-0.59
rh
-0.59
rainy
-0.59
leftover
-0.58
POSITIVE LOGITS
bec
1.10
leans
1.03
oming
1.02
zek
0.99
isons
0.95
racuse
0.94
clair
0.94
uity
0.91
erie
0.90
Bec
0.85
Activations Density 0.004%