INDEX
Negative Logits
CRS
-0.75
WriteLiteral
-0.74
alz
-0.68
IDA
-0.68
Narciss
-0.68
]=="
-0.67
CRS
-0.66
PTS
-0.65
nass
-0.65
Verr
-0.63
POSITIVE LOGITS
UK
1.14
Unido
1.06
հղումներ
0.93
britannien
0.92
UK
0.90
Visser
0.83
rrggbb
0.80
disambiguazione
0.80
CONSIN
0.78
kingdom
0.77
Activations Density 0.013%