INDEX
Explanations
terms associated with research, disclosure, and scientific information
New Auto-Interp
Negative Logits
-0.78
in
-0.67
information
-0.67
im
-0.66
experience
-0.64
an
-0.63
is
-0.61
use
-0.61
to
-0.60
,
-0.60
POSITIVE LOGITS
myſelf
1.59
ſeveral
1.39
Monfieur
1.38
Reſ
1.37
Majefty
1.34
Eſ
1.33
feroit
1.32
whoſe
1.32
houſe
1.32
Jefus
1.30
Activations Density 0.323%