INDEX
Explanations
words related to birds
words related to birds and their characteristics
New Auto-Interp
Negative Logits
ptive
-0.77
quit
-0.76
icago
-0.72
perature
-0.70
pering
-0.69
ptives
-0.68
course
-0.68
uble
-0.68
auga
-0.67
hemor
-0.67
POSITIVE LOGITS
ird
0.89
ressing
0.86
orf
0.82
rie
0.80
odge
0.78
ness
0.78
ress
0.78
ed
0.75
rop
0.75
rill
0.74
Activations Density 0.010%