INDEX
Explanations
words related to bears
references to "Bear" and related terms
New Auto-Interp
Negative Logits
istry
-0.81
selves
-0.74
lectic
-0.70
opus
-0.70
imates
-0.68
inel
-0.66
ubuntu
-0.66
ADRA
-0.66
ablishment
-0.66
isters
-0.66
POSITIVE LOGITS
bear
0.98
Gry
0.89
cub
0.89
beit
0.88
Bears
0.88
hug
0.87
Grizz
0.84
xual
0.84
cats
0.82
hugs
0.80
Activations Density 0.036%