INDEX
Explanations
phrases related to problem-solving and leadership
concepts related to truth and inevitability
New Auto-Interp
Negative Logits
withd
-0.62
Seym
-0.58
ãĤ¶
-0.57
anwhile
-0.57
Moroc
-0.55
escription
-0.55
76561
-0.54
senal
-0.52
subur
-0.52
Previously
-0.51
POSITIVE LOGITS
ain
0.80
yours
0.76
)!
0.72
deserve
0.67
damn
0.67
damned
0.64
!--
0.63
shit
0.63
)?
0.61
':
0.61
Activations Density 1.107%