INDEX
Explanations
references to philosophical concepts and literature
New Auto-Interp
Negative Logits
ÑĨеÑĢков
-0.15
acente
-0.15
ìĶ
-0.15
Anglic
-0.14
ÑĨеÑĢкви
-0.14
íά
-0.14
.px
-0.14
illisecond
-0.14
Fah
-0.14
church
-0.14
POSITIVE LOGITS
Republic
0.26
Republic
0.24
Sok
0.23
dialog
0.22
Plato
0.21
Soph
0.21
Athens
0.20
City
0.20
city
0.20
Cave
0.20
Activations Density 0.019%