INDEX
Explanations
phrases or terms related to journalism
New Auto-Interp
Negative Logits
charg
-0.75
xual
-0.67
ranged
-0.67
inence
-0.66
hens
-0.64
ergy
-0.63
imus
-0.63
worth
-0.63
ifiers
-0.63
bodied
-0.61
POSITIVE LOGITS
Journalism
1.00
Journalists
0.95
journalism
0.95
ethics
0.80
racuse
0.78
corps
0.77
journalist
0.76
Integrity
0.76
journalists
0.74
mbuds
0.74
Activations Density 0.033%