INDEX
Explanations
references to bananas and their characteristics
New Auto-Interp
Negative Logits
emm
-0.16
æ¡IJ
-0.15
rase
-0.15
ãĥ¼ãĥį
-0.15
_traits
-0.15
çĭIJ
-0.15
æĻ¶
-0.15
hare
-0.15
ecer
-0.14
illac
-0.14
POSITIVE LOGITS
Banana
0.24
banana
0.23
ripe
0.22
banana
0.22
bananas
0.21
èķī
0.20
peel
0.20
ripe
0.19
Ban
0.19
Ban
0.18
Activations Density 0.030%