INDEX
Explanations
mentions of cats or similar words related to cats
references to cats or cat-related themes
New Auto-Interp
Negative Logits
OND
-0.71
FontSize
-0.70
hower
-0.66
mble
-0.66
Seym
-0.65
Fellowship
-0.63
ij士
-0.63
Impossible
-0.61
assetsadobe
-0.61
èĢħ
-0.61
POSITIVE LOGITS
aclysm
1.62
heter
1.38
apult
1.33
chers
1.30
cher
1.24
alogue
1.19
alyst
1.19
hedral
1.14
alog
1.14
fish
1.08
Activations Density 0.025%