INDEX
Explanations
verbs related to aggressive physical actions
labels or descriptors related to consumption or physical actions involving objects
New Auto-Interp
Negative Logits
CRIP
-0.78
alam
-0.70
oft
-0.69
ashtra
-0.67
WP
-0.64
Initialized
-0.63
omething
-0.61
arium
-0.60
naire
-0.60
Edison
-0.60
POSITIVE LOGITS
bing
2.23
bed
1.88
bling
1.87
bled
1.82
bles
1.74
ber
1.70
bers
1.70
blers
1.66
bler
1.63
ble
1.61
Activations Density 0.129%