INDEX
Explanations
references to various levels of participation or implication in certain actions or events
references to involvement in various activities or events
New Auto-Interp
Negative Logits
sell
-0.71
Balanced
-0.70
Jet
-0.70
Float
-0.69
\\\\\\\\
-0.64
Rapt
-0.62
imb
-0.61
Lev
-0.61
Leon
-0.61
Spa
-0.60
POSITIVE LOGITS
enza
0.91
involvement
0.88
annel
0.82
ioned
0.81
atform
0.76
itaire
0.74
oice
0.73
iveness
0.73
rity
0.72
hips
0.71
Activations Density 0.016%