INDEX
Explanations
references to pet characteristics and behavior
New Auto-Interp
Negative Logits
Sphere
-0.16
Tweet
-0.15
_ble
-0.14
nett
-0.14
-----------*/↵
-0.14
пÑĢоз
-0.14
Ness
-0.14
ÏĢε
-0.14
_fh
-0.14
Sphere
-0.13
POSITIVE LOGITS
foster
0.18
neut
0.18
reactive
0.17
Labs
0.17
ossier
0.17
gentle
0.17
crate
0.17
gent
0.16
gentleman
0.16
idal
0.15
Activations Density 0.024%