INDEX
Explanations
words and phrases related to being new or a beginner
New Auto-Interp
Negative Logits
љи
-0.52
thog
-0.51
halve
-0.50
धान
-0.49
waarom
-0.47
labelledby
-0.47
urably
-0.46
respectively
-0.46
mít
-0.45
arsch
-0.45
POSITIVE LOGITS
newcomer
1.03
newcomers
1.00
novice
0.89
beginner
0.89
NewLabel
0.88
newbie
0.87
newbies
0.86
دانشنامهٔ
0.82
Newly
0.82
rookies
0.81
Activations Density 0.170%