INDEX
Explanations
instances of the word 'first' occurring in sentences
occurrences of the word "first."
New Auto-Interp
Negative Logits
tics
-0.76
mbuds
-0.74
acons
-0.71
Progress
-0.70
borg
-0.69
pers
-0.68
athed
-0.66
leaders
-0.66
apped
-0.64
STEM
-0.63
POSITIVE LOGITS
thing
1.02
iteration
0.95
few
0.93
batch
0.93
couple
0.92
installment
0.91
baseman
0.91
step
0.90
layer
0.90
glance
0.90
Activations Density 0.128%