INDEX
Explanations
references to the word "fish" and related terms
New Auto-Interp
Negative Logits
MSI
-0.69
Canter
-0.65
xus
-0.63
AMC
-0.63
Madison
-0.60
Machina
-0.59
rian
-0.58
FINE
-0.58
verend
-0.58
Citizens
-0.57
POSITIVE LOGITS
bowl
1.46
mong
1.28
hook
1.23
nets
1.20
tails
1.14
tail
1.13
meal
1.11
tailed
1.10
bone
1.05
tank
1.02
Activations Density 0.048%