INDEX
Explanations
references to historical narratives and linguistic origins
New Auto-Interp
Negative Logits
Alexand
-0.16
cubes
-0.15
èĩ¨
-0.14
Minh
-0.13
άÏĥ
-0.13
976
-0.13
tess
-0.13
úÄįast
-0.13
Alexandria
-0.13
intent
-0.13
POSITIVE LOGITS
Alta
0.25
Proto
0.25
Indo
0.23
éģĬ
0.22
speakers
0.21
Proto
0.20
Turk
0.20
/proto
0.20
proto
0.19
PIE
0.19
Activations Density 0.033%