INDEX
Explanations
references to specific animal and plant species
New Auto-Interp
Negative Logits
rud
-0.15
acock
-0.14
alex
-0.14
Alexand
-0.14
FUNC
-0.14
arus
-0.14
ÏĢα
-0.14
vet
-0.14
Sink
-0.14
quir
-0.13
POSITIVE LOGITS
found
0.21
found
0.20
Found
0.19
Found
0.18
widespread
0.17
natives
0.17
native
0.17
encontr
0.16
à¸ŀà¸ļ
0.16
IME
0.15
Activations Density 0.095%