INDEX
Explanations
bird names, specifically focusing on types of birds
references to birds, particularly doves and eagles
New Auto-Interp
Negative Logits
Gas
-0.84
olute
-0.75
rogen
-0.67
Sing
-0.65
coerc
-0.64
isters
-0.64
infeld
-0.63
LV
-0.63
enter
-0.63
akespe
-0.63
POSITIVE LOGITS
eagle
1.14
dove
1.01
pigeon
0.94
tail
0.92
owl
0.92
hawk
0.86
trout
0.82
Owl
0.82
vironment
0.81
hawk
0.81
Activations Density 0.020%