INDEX
Explanations
specific phrases or elements related to historical or geographical contexts
New Auto-Interp
Negative Logits
clipse
-0.16
isko
-0.15
ponential
-0.15
IDD
-0.15
åħ±åĴĮ
-0.15
raj
-0.14
ITT
-0.14
inux
-0.14
urma
-0.14
ication
-0.14
POSITIVE LOGITS
Patri
0.36
Auto
0.29
auto
0.28
Ec
0.27
auto
0.27
patriarch
0.26
Auto
0.26
Barth
0.25
AUTO
0.23
Greek
0.23
Activations Density 0.036%