INDEX
Explanations
phrases related to detailed descriptions of past events or actions
New Auto-Interp
Negative Logits
cation
-0.86
terness
-0.79
uncture
-0.78
iliation
-0.77
icism
-0.75
atform
-0.74
fecture
-0.72
position
-0.72
heit
-0.72
mire
-0.72
POSITIVE LOGITS
kinds
1.10
sorts
1.05
facts
1.04
truths
1.04
types
0.99
types
0.96
days
0.94
cases
0.94
qualities
0.93
fellows
0.92
Activations Density 2.690%