INDEX
Explanations
sentences about plans, arrangements, and decisions
contractions of "I", "we", and "are"
New Auto-Interp
Negative Logits
citiz
-0.74
odied
-0.67
ilial
-0.65
Kills
-0.63
violates
-0.62
Virus
-0.61
Difference
-0.61
distinction
-0.61
Fail
-0.60
Critics
-0.60
POSITIVE LOGITS
hoping
1.28
expecting
1.24
anticipating
1.23
hopeful
1.16
eyeing
1.16
confident
1.14
aiming
1.09
planning
1.09
preparing
1.08
gearing
1.08
Activations Density 0.222%