INDEX
Explanations
mentions of medical conditions or symptoms
terms related to the concept of "out" or "being out," especially in various contexts
New Auto-Interp
Negative Logits
ha
-0.85
Gra
-0.82
leigh
-0.81
gra
-0.81
hei
-0.75
angle
-0.73
helic
-0.72
lia
-0.72
hene
-0.70
ŃĶ
-0.70
POSITIVE LOGITS
Out
1.99
Out
1.89
OUT
1.83
out
1.80
out
1.72
OUT
1.71
outs
1.51
outs
1.51
Outs
1.41
outed
1.15
Activations Density 0.110%