INDEX
Explanations
phrases related to beginner-level experiences and learning
New Auto-Interp
Negative Logits
resourceCulture
-0.69
'\\;'
-0.69
DockStyle
-0.62
ostavi
-0.62
kasarigan
-0.60
newswire
-0.59
httphttps
-0.58
timewa
-0.57
urlopen
-0.57
حياته
-0.56
POSITIVE LOGITS
skill
0.95
skill
0.86
skills
0.77
Skill
0.76
unskilled
0.74
beginner
0.74
proficiency
0.73
novice
0.73
skills
0.72
Beginners
0.72
Activations Density 0.246%