INDEX
Explanations
proper nouns related to people, places, and organizations
phrases that indicate actions or events occurring over time
New Auto-Interp
Negative Logits
antage
-0.76
ologies
-0.73
assumes
-0.70
substrate
-0.70
consumes
-0.70
rates
-0.69
governs
-0.68
improves
-0.68
lies
-0.68
relates
-0.67
POSITIVE LOGITS
EStream
0.68
looph
0.68
Borough
0.68
Stage
0.68
Kinn
0.67
psychiat
0.66
cffffcc
0.66
aback
0.65
last
0.64
wiret
0.64
Activations Density 2.370%