INDEX
Explanations
references to Greek terms, particularly those related to concepts and classifications
New Auto-Interp
Negative Logits
ÏĦÏĮÏĥο
-0.17
EMPLARY
-0.16
ÏħÏĢάÏģÏĩοÏħν
-0.16
ðŁ
-0.15
ακÏĮ
-0.15
Ð®ÐĽ
-0.15
είÏĩαν
-0.15
ðŁ
-0.14
&apos
-0.14
ÏĮμÏīÏĤ
-0.14
POSITIVE LOGITS
γ
0.27
κ
0.27
δ
0.26
κ
0.26
ο
0.26
Ïĥ
0.26
α
0.26
ÏĦ
0.25
Îĺ
0.25
ν
0.25
Activations Density 0.128%