INDEX
Explanations
specific scientific or technical terms related to human and animal biology
New Auto-Interp
Negative Logits
human
-0.73
Human
-0.69
human
-0.68
humana
-0.68
humanas
-0.63
menschlichen
-0.63
humanos
-0.62
HUMAN
-0.62
HUMAN
-0.61
menschliche
-0.61
POSITIVE LOGITS
تضيفلها
0.51
0.50
EndContext
0.46
SBATCH
0.44
Geplaatst
0.44
colnshire
0.43
PLANATION
0.43
familiar
0.40
familiar
0.40
脚注の使い方
0.40
Activations Density 0.027%