INDEX
Explanations
mentions of professions or job titles
references to various professional roles or occupations
New Auto-Interp
Negative Logits
co
-0.69
ston
-0.67
CO
-0.67
TI
-0.67
ces
-0.66
Territories
-0.65
TON
-0.64
Anything
-0.62
VI
-0.62
tein
-0.61
POSITIVE LOGITS
poons
0.91
paces
0.90
encount
0.86
heet
0.85
challeng
0.84
sugg
0.84
uggest
0.81
allege
0.81
ioned
0.80
learned
0.79
Activations Density 0.207%