INDEX
Explanations
references to bison and buffalo
mammoth, bison, buffalo, rhino
New Auto-Interp
Negative Logits
InputModule
-0.39
Palt
-0.37
Bettina
-0.36
Rina
-0.36
providedIn
-0.36
Hentet
-0.36
__':
-0.36
latin
-0.36
Harry
-0.36
garn
-0.35
POSITIVE LOGITS
bison
1.60
Bison
1.50
buffalo
1.44
buffalo
1.23
Buffalo
1.03
Buffalo
1.02
bison
1.00
🦬
0.69
Grizzly
0.69
BUFF
0.65
Activations Density 0.004%