INDEX
Explanations
phrases indicating that it is time to take specific actions or make changes
New Auto-Interp
Negative Logits
insula
-0.75
iership
-0.74
ournal
-0.74
riad
-0.73
76561
-0.71
pes
-0.70
perpend
-0.69
agonist
-0.68
utsu
-0.67
hemat
-0.66
POSITIVE LOGITS
anew
0.77
Governments
0.74
recons
0.67
overdue
0.66
consuming
0.65
ful
0.64
llor
0.64
reconsider
0.62
ward
0.62
aliens
0.62
Activations Density 0.023%