INDEX
Explanations
verbs in the present participle form ending in "-ing."
New Auto-Interp
Negative Logits
velength
-0.77
isable
-0.68
esthetic
-0.65
asonable
-0.63
WAYS
-0.62
Rouge
-0.62
ndra
-0.61
fx
-0.61
RED
-0.61
llular
-0.60
POSITIVE LOGITS
buster
0.98
neck
0.94
bust
0.89
enegger
0.89
lar
0.85
s
0.84
ing
0.81
les
0.81
sch
0.81
busters
0.80
Activations Density 0.019%