INDEX
Explanations
references to significant observations and experiences of historical figures, particularly in the context of exploration and science
New Auto-Interp
Negative Logits
dumping
-0.15
elah
-0.15
jur
-0.15
rupa
-0.15
hiba
-0.15
Dump
-0.14
dumps
-0.14
816
-0.14
ĸ
-0.14
ок
-0.14
POSITIVE LOGITS
Gal
0.32
Gal
0.26
Darwin
0.25
GAL
0.23
tort
0.23
gal
0.22
Ecuador
0.20
Tort
0.20
Naz
0.19
TORT
0.18
Activations Density 0.014%