INDEX
Explanations
scientific terms and institutions
proper nouns related to academic institutions and medical terms
New Auto-Interp
Negative Logits
à©
-0.53
nonex
-0.49
amen
-0.49
ãģĤ
-0.49
predic
-0.49
somet
-0.47
Instr
-0.47
Ire
-0.47
nodd
-0.46
unman
-0.46
POSITIVE LOGITS
,[
1.14
);
1.11
)?
1.10
),
1.09
.),
1.08
.)
1.05
);
1.04
,)
1.03
),
1.03
).[
1.02
Activations Density 0.606%