INDEX
Explanations
dates and chronological references
New Auto-Interp
Negative Logits
OGND
-0.84
expandindo
-0.83
Infór
-0.76
rungsseite
-0.75
miniaturka
-0.75
Grüsse
-0.70
-0.70
ligiloj
-0.70
<unused41>
-0.69
<unused43>
-0.69
POSITIVE LOGITS
Ny
0.56
March
0.55
march
0.52
_
0.47
St
0.44
Ein
0.44
Dec
0.43
*
0.43
ni
0.42
|
0.42
Activations Density 0.470%