INDEX
Explanations
references to bananas
references to bananas and related fruit imagery
New Auto-Interp
Negative Logits
Lev
-0.87
stre
-0.86
Shed
-0.83
Lear
-0.80
quest
-0.73
Tem
-0.72
Eng
-0.71
Gil
-0.71
IRE
-0.71
Ward
-0.70
POSITIVE LOGITS
banana
3.58
bananas
3.29
Banana
2.73
mango
1.92
pineapple
1.70
strawberries
1.65
oranges
1.63
gorilla
1.57
strawberry
1.52
peach
1.42
Activations Density 0.027%