INDEX
Explanations
personal viewpoints and reflections
contexts involving problematic behaviors or situations
New Auto-Interp
Negative Logits
ascript
-0.76
iken
-0.70
DNA
-0.67
Target
-0.65
pent
-0.65
Template
-0.65
ieval
-0.61
dating
-0.59
cli
-0.58
cale
-0.58
POSITIVE LOGITS
importantly
0.80
therein
0.75
cru
0.69
culmin
0.68
TRUMP
0.68
besides
0.64
GOODMAN
0.63
thereof
0.61
weary
0.61
coerc
0.60
Activations Density 0.364%