INDEX
Explanations
mentions of fish-related content
New Auto-Interp
Negative Logits
myſelf
-1.01
itſelf
-0.96
poffible
-0.93
greateſt
-0.90
Bly
-0.89
universitarios
-0.85
purpoſe
-0.85
maxSize
-0.84
themſelves
-0.83
Jefus
-0.83
POSITIVE LOGITS
fish
2.19
Fish
2.03
FISH
1.86
Fish
1.81
fish
1.79
FISH
1.59
fishes
1.56
Fisch
1.41
Fishes
1.31
鱼
1.23
Activations Density 0.031%