INDEX
Explanations
questions starting with "What" or "How"
questions or prompts that invite further exploration or inquiry
New Auto-Interp
Negative Logits
rique
-0.64
unction
-0.64
udos
-0.63
Settlement
-0.63
cott
-0.62
¬¼
-0.62
minster
-0.62
wash
-0.61
oppers
-0.61
§
-0.60
POSITIVE LOGITS
namely
1.14
viz
0.92
whether
0.89
versus
0.87
etc
0.86
whether
0.85
realistically
0.84
how
0.84
besides
0.84
assuming
0.81
Activations Density 0.279%