INDEX
Explanations
strategic defense facilities
New Auto-Interp
Negative Logits
ug
0.51
watering
0.51
တော့
0.49
ਰ
0.48
iberg
0.47
ivg
0.47
ьогодні
0.46
अगेन
0.46
těchto
0.46
ప్రదేశ్
0.46
POSITIVE LOGITS
enrich
0.42
守
0.42
completa
0.42
incomple
0.42
completeness
0.42
passing
0.41
complaint
0.41
סים
0.41
specifica
0.40
。
0.40
Activations Density 0.007%