INDEX
Explanations
terms related to health issues and associated risk factors
New Auto-Interp
Negative Logits
kasarigan
-1.25
Efq
-1.19
متعلقه
-1.18
ſeveral
-1.15
myſelf
-1.15
NUMX
-1.12
pleaſure
-1.11
purpoſe
-1.10
Majefty
-1.08
Jefus
-1.07
POSITIVE LOGITS
0.65
or
0.48
1
0.46
et
0.44
-
0.43
orced
0.43
+
0.43
The
0.42
and
0.41
<eos>
0.41
Activations Density 9.260%