INDEX
Explanations
mentions of writing or coding-related activities
instances of writing and related activities
New Auto-Interp
Negative Logits
osate
-0.93
terness
-0.89
yip
-0.88
76561
-0.85
ESA
-0.79
ordering
-0.79
ridor
-0.78
ĸļ
-0.77
pling
-0.74
ENTION
-0.70
POSITIVE LOGITS
professionally
1.12
weddings
1.02
comics
0.96
documentaries
0.95
podcasts
0.91
novels
0.91
music
0.88
videog
0.86
movies
0.85
mysteries
0.83
Activations Density 0.363%