INDEX
Explanations
references to bananas and their states of ripeness
New Auto-Interp
Negative Logits
ilm
-0.17
emm
-0.15
-tank
-0.15
.tencent
-0.14
Canter
-0.14
_NM
-0.14
lements
-0.14
ubi
-0.14
burg
-0.14
itra
-0.14
POSITIVE LOGITS
banana
0.42
Banana
0.39
bananas
0.39
banana
0.36
Ban
0.35
Ban
0.35
é¦Ļèķī
0.33
èķī
0.29
ban
0.29
ban
0.29
Activations Density 0.015%