INDEX
Explanations
proposals or important decisions
keywords related to proposals, decisions, and significant actions or narratives
New Auto-Interp
Negative Logits
tics
-0.76
itudes
-0.73
ses
-0.70
Machines
-0.68
Cups
-0.67
etsk
-0.65
akia
-0.65
hett
-0.65
Occupations
-0.64
olds
-0.62
POSITIVE LOGITS
atical
0.82
standpoint
0.72
assian
0.69
akin
0.68
dylib
0.66
called
0.65
nonetheless
0.65
earance
0.64
ulent
0.64
Timeout
0.63
Activations Density 0.553%