INDEX
Explanations
phrases related to decisions or actions taken by individuals or groups
references to significant decisions or actions that have important implications
New Auto-Interp
Negative Logits
magazines
-0.75
models
-0.66
watches
-0.63
coats
-0.62
inhabit
-0.61
orbits
-0.60
loves
-0.60
Element
-0.60
Candy
-0.59
sizes
-0.59
POSITIVE LOGITS
moot
0.92
deterrence
0.82
AFTA
0.77
ĵĺ
0.75
aimed
0.74
vind
0.73
jeopard
0.71
imminent
0.70
retrospective
0.70
timed
0.69
Activations Density 0.591%