INDEX
Explanations
proper nouns and specific references in various contexts
New Auto-Interp
Negative Logits
انيف
-0.60
spesies
-0.56
beta
-0.53
Beta
-0.53
LoggerFactory
-0.52
IRUS
-0.51
β
-0.51
verwijspagina
-0.49
Frankel
-0.49
ErrIntOverflow
-0.48
POSITIVE LOGITS
Roast
0.84
Roberta
0.81
ROB
0.80
Rovers
0.77
ro
0.76
Robe
0.76
ro
0.76
Rosalie
0.75
Ro
0.73
RO
0.73
Activations Density 3.176%