INDEX
Explanations
questions related to decision-making
conjunctions and phrases indicating relationships or connections
New Auto-Interp
Negative Logits
FIELD
-0.62
ICT
-0.58
estones
-0.56
onia
-0.55
inqu
-0.55
onte
-0.54
aditional
-0.52
necks
-0.52
itual
-0.52
rite
-0.51
POSITIVE LOGITS
how
1.80
why
1.75
whence
1.62
why
1.60
what
1.58
what
1.53
WHY
1.52
how
1.52
HOW
1.42
WHAT
1.40
Activations Density 0.394%