INDEX
Explanations
mentions of specific individuals or companies
phrases related to personal attributes or characteristics of individuals
New Auto-Interp
Negative Logits
agre
-0.65
ipel
-0.64
reth
-0.60
:(
-0.57
Canaan
-0.57
imester
-0.57
contend
-0.57
stockpile
-0.56
*:
-0.56
ramid
-0.55
POSITIVE LOGITS
pires
0.73
fol
0.70
azine
0.69
should
0.66
front
0.66
stood
0.62
Mom
0.60
Drive
0.60
Pros
0.59
pite
0.59
Activations Density 0.853%