INDEX
Explanations
words related to animal behavior, specifically in experimental settings
New Auto-Interp
Negative Logits
RELEASE
-0.71
EMS
-0.69
)|
-0.65
iatus
-0.64
misunder
-0.63
.''
-0.61
ccording
-0.60
arrang
-0.60
.''.
-0.60
=================================================================
-0.59
POSITIVE LOGITS
secondly
0.76
Conversely
0.74
others
0.74
likewise
0.73
vice
0.69
Likewise
0.65
thouse
0.63
Others
0.63
Others
0.62
oliath
0.62
Activations Density 3.040%