INDEX
Explanations
describing physical materials and objects
New Auto-Interp
Negative Logits
잖아요
0.50
paediatric
0.49
bebés
0.48
samoglas
0.47
pediatric
0.45
neighbourhoods
0.45
quantified
0.44
寶寶
0.44
legislators
0.44
people
0.43
POSITIVE LOGITS
Raven
0.61
obsidian
0.55
Iron
0.55
dimly
0.54
Bronze
0.52
weathered
0.51
crimson
0.51
прочем
0.51
meager
0.51
jagged
0.50
Activations Density 0.030%