INDEX
Explanations
mentions of the word "penguins"
references to penguins
New Auto-Interp
Negative Logits
Interstitial
-0.82
nda
-0.78
¿½
-0.77
ysis
-0.71
phas
-0.68
nces
-0.67
nea
-0.66
guyen
-0.65
puter
-0.65
ameda
-0.64
POSITIVE LOGITS
Penguins
1.17
insula
0.96
DragonMagazine
0.94
pengu
0.87
Hots
0.83
éĹĺ
0.80
Pengu
0.77
ozo
0.77
Sharks
0.76
sburg
0.74
Activations Density 0.011%