INDEX
Negative Logits
transt
0.41
twenty
0.40
culturally
0.38
вут
0.38
stomachs
0.38
Survey
0.38
двадцать
0.38
Survey
0.38
certific
0.38
verk
0.38
POSITIVE LOGITS
BUCK
0.40
쎈
0.37
fxml
0.37
েস্ক
0.36
ecta
0.36
ellas
0.35
በፊት
0.35
🍦
0.35
Xin
0.34
PanelVisual
0.34
Activations Density 0.002%