INDEX
Explanations
legitimacy and authenticity
New Auto-Interp
Negative Logits
potrebbe
0.42
खूप
0.41
很大
0.41
preferencias
0.40
uitgebre
0.40
できるよう
0.40
prosperity
0.40
putern
0.39
可能会
0.39
thậm
0.39
POSITIVE LOGITS
legít
0.90
legitimate
0.88
genuine
0.78
legitimately
0.77
officially
0.76
genuine
0.76
lawful
0.73
lawfully
0.72
authentic
0.72
合法
0.70
Activations Density 0.806%