INDEX
Explanations
terms related to animals and animal welfare
terms related to animal welfare and legal regulations concerning animals and agriculture
New Auto-Interp
Negative Logits
HK
-0.89
urat
-0.85
aughs
-0.80
ynthesis
-0.78
selves
-0.77
Kinnikuman
-0.76
JD
-0.73
ours
-0.72
é¾
-0.72
flies
-0.72
POSITIVE LOGITS
care
0.99
prevention
0.98
insurance
0.98
safety
0.97
endanger
0.95
welfare
0.94
protective
0.94
authorization
0.91
counseling
0.89
protections
0.89
Activations Density 0.441%