INDEX
Explanations
structural and functional aspects related to biological processes and environmental factors
New Auto-Interp
Negative Logits
unwanted
-0.16
ikip
-0.15
aris
-0.15
yna
-0.14
suppress
-0.14
isan
-0.14
ifiable
-0.13
induced
-0.13
loy
-0.13
lasting
-0.13
POSITIVE LOGITS
underlying
0.40
responsible
0.40
driving
0.39
behind
0.36
Driving
0.32
Driving
0.30
Responsible
0.28
responsable
0.28
Behind
0.28
governing
0.27
Activations Density 0.432%