INDEX
Explanations
teaching and promoting education
New Auto-Interp
Negative Logits
ellä
0.41
বগু
0.39
έλ
0.38
iąz
0.37
ogen
0.37
oloč
0.37
numele
0.37
స్ప
0.36
names
0.36
emerald
0.35
POSITIVE LOGITS
التعليم
0.45
हैप्पी
0.42
طا
0.41
promoting
0.40
ת
0.39
fighting
0.39
Educação
0.39
تعالی
0.39
defying
0.38
lecturing
0.38
Activations Density 0.001%