INDEX
Explanations
the word "vel" with varying levels of activation
words or phrases that indicate high value or excellence
New Auto-Interp
Negative Logits
SHARE
-0.67
AME
-0.66
BILITIES
-0.64
EMS
-0.63
nces
-0.63
Ended
-0.62
apple
-0.60
SAM
-0.60
Noon
-0.60
MIS
-0.60
POSITIVE LOGITS
ocity
1.28
vel
1.06
icit
0.88
vet
0.82
ieve
0.81
iev
0.81
icular
0.80
theless
0.78
istically
0.78
edo
0.77
Activations Density 0.006%