INDEX
Explanations
the word "amb" with varying levels of activation
instances of the token "amb" in various contexts
New Auto-Interp
Negative Logits
naires
-0.77
backer
-0.74
kies
-0.71
creen
-0.71
Downloadha
-0.70
terday
-0.68
housing
-0.68
ãĥīãĥ©
-0.66
spection
-0.66
suspicion
-0.66
POSITIVE LOGITS
ilib
1.04
assador
1.02
odies
0.90
assadors
0.90
ilit
0.87
rill
0.87
alo
0.86
uilding
0.84
olic
0.83
ulative
0.82
Activations Density 0.019%