INDEX
Explanations
ancient Greece, Rome, and Egypt
New Auto-Interp
Negative Logits
ਕਰਨ
0.51
quela
0.49
<unused1818>
0.49
rebounds
0.48
ማድረግ
0.48
isa
0.48
зона
0.48
मधुमेह
0.48
thuận
0.47
оста
0.47
POSITIVE LOGITS
Greek
0.91
griech
0.90
greek
0.84
Griechen
0.80
Greek
0.79
grec
0.74
Greece
0.73
ancient
0.67
griega
0.67
Greece
0.67
Activations Density 0.281%