INDEX
Explanations
sharp objects or instances described as sharp
references to sharp objects or sharpness in general
New Auto-Interp
Negative Logits
mits
-0.79
pty
-0.79
ople
-0.74
rella
-0.72
ORED
-0.70
ccess
-0.70
BLE
-0.69
mitting
-0.69
uthor
-0.66
avis
-0.66
POSITIVE LOGITS
ened
1.07
sharp
1.03
ening
0.95
sharper
0.87
ness
0.85
sharp
0.82
blade
0.81
distinction
0.80
ener
0.76
spikes
0.76
Activations Density 0.009%