INDEX
Explanations
Brazilian or Portuguese contexts
New Auto-Interp
Negative Logits
宾
0.57
Ջ
0.54
venerable
0.54
ግዳ
0.52
stalwart
0.51
odge
0.50
ඔ
0.50
Collingwood
0.49
强
0.49
ब्रिटेन
0.49
POSITIVE LOGITS
Brazilian
1.51
Brazilian
1.48
brazilian
1.48
Brasil
1.40
brasileiros
1.39
brasileiro
1.38
brasile
1.37
brasil
1.35
brasil
1.34
Brasil
1.34
Activations Density 0.082%