INDEX
Explanations
terms related to health conditions and their causal relationships
New Auto-Interp
Negative Logits
-0.60
can
-0.52
Anda
-0.50
super
-0.50
normal
-0.48
off
-0.47
dis
-0.47
é
-0.47
inter
-0.47
don
-0.46
POSITIVE LOGITS
Jefus
1.05
Efq
1.00
itſelf
0.97
Monfieur
0.93
myſelf
0.90
enfans
0.88
ſeveral
0.87
purpoſe
0.85
Majefty
0.85
themſelves
0.84
Activations Density 1.032%