INDEX
Explanations
mentions and variations of the word "banana."
New Auto-Interp
Negative Logits
fjspx
-0.53
giorgio
-0.48
Савезне
-0.47
}{*}{-0.45
ertale
-0.45
*~
-0.45
oult
-0.44
/*
-0.44
:]:
-0.43
')";
-0.43
POSITIVE LOGITS
Banana
1.20
banana
1.19
Banana
1.14
Bananas
1.03
Bananas
1.02
banana
1.00
bananas
0.98
banane
0.94
banan
0.84
🍌
0.73
Activations Density 0.002%