INDEX
Explanations
terms related to vital support or essential components of life
New Auto-Interp
Negative Logits
icot
-0.18
canf
-0.18
ibold
-0.17
modne
-0.16
/******/
-0.16
ubat
-0.16
.dds
-0.16
Ware
-0.15
odense
-0.15
znik
-0.14
POSITIVE LOGITS
con
0.18
0.16
ed
0.15
Multip
0.15
ember
0.15
US
0.15
ilo
0.15
Syn
0.15
l
0.15
yn
0.15
Activations Density 0.004%