INDEX
Explanations
Russian language characters and words
New Auto-Interp
Negative Logits
Franch
-0.71
theless
-0.67
essa
-0.67
ulators
-0.66
concede
-0.65
gad
-0.63
board
-0.62
Ada
-0.61
Bun
-0.61
venture
-0.61
POSITIVE LOGITS
Į
1.99
¢
1.90
¹
1.89
¥
1.87
²
1.86
±
1.85
´
1.85
Ń
1.84
Ķ
1.80
³
1.80
Activations Density 0.252%