INDEX
Explanations
important events or decisions
significant announcements or changes in context
New Auto-Interp
Negative Logits
Pros
-0.68
2600
-0.60
training
-0.60
gom
-0.58
Narr
-0.58
âĹ¼
-0.57
layer
-0.56
dying
-0.56
invincible
-0.55
ocrates
-0.54
POSITIVE LOGITS
coincides
1.36
coincided
1.31
underscores
1.24
signifies
1.20
comes
1.17
reflects
1.15
reinforces
1.13
brings
1.13
represents
1.11
prompted
1.10
Activations Density 0.238%