INDEX
Explanations
references to animals and their characteristics or actions
New Auto-Interp
Negative Logits
oversight
-0.87
appropriations
-0.85
Oversight
-0.75
affirmative
-0.75
quickShipAvailable
-0.71
Disclosure
-0.71
entitle
-0.68
icit
-0.67
ãĥ¼ãĥĨãĤ£
-0.67
implementation
-0.67
POSITIVE LOGITS
ensis
0.96
nests
0.88
vae
0.85
birds
0.80
fungus
0.80
reptiles
0.79
auri
0.78
worms
0.77
nib
0.77
larvae
0.76
Activations Density 0.946%