INDEX
Explanations
information related to current events and social media updates
New Auto-Interp
Negative Logits
ĪĴ
-0.78
lobb
-0.78
prevailing
-0.75
Elias
-0.74
nomine
-0.74
lim
-0.74
possession
-0.72
utilities
-0.72
Romanian
-0.72
telecommunications
-0.72
POSITIVE LOGITS
cdn
1.18
0.93
journal
0.91
doi
0.90
chrom
0.86
books
0.86
Appears
0.86
cp
0.85
dp
0.84
fff
0.84
Activations Density 0.012%