INDEX
Explanations
references to "SN" followed by numbers representing specific entities or concepts
references to specific scientific terminology and abbreviations, particularly those related to biology and genetics
New Auto-Interp
Negative Logits
cens
-0.70
================================
-0.70
Lama
-0.66
Templ
-0.64
mum
-0.64
tons
-0.63
erection
-0.63
Emir
-0.62
kson
-0.61
tul
-0.61
POSITIVE LOGITS
OW
0.98
SN
0.93
MP
0.78
BOOK
0.74
OV
0.74
ERG
0.73
VD
0.73
eele
0.72
SN
0.72
ASH
0.70
Activations Density 0.039%