INDEX
Explanations
references to scientific theories and their historical development
New Auto-Interp
Negative Logits
çIJ³
-0.17
uide
-0.16
ensburg
-0.14
ourcing
-0.14
Bauer
-0.14
Seasons
-0.14
istrat
-0.14
reator
-0.13
elize
-0.13
Pending
-0.13
POSITIVE LOGITS
196
0.22
chor
0.18
197
0.17
195
0.17
original
0.16
originally
0.16
ory
0.16
iya
0.15
194
0.15
198
0.15
Activations Density 0.074%