INDEX
Explanations
instances where the text discusses the implications or consequences of certain actions or decisions
references to the word "it" often in contexts suggesting discussion of a subject or object
New Auto-Interp
Negative Logits
UGH
-0.71
OME
-0.69
IFE
-0.66
Finish
-0.64
Hope
-0.64
FUL
-0.62
Flight
-0.61
Magikarp
-0.61
Genius
-0.61
hift
-0.61
POSITIVE LOGITS
involves
1.33
relates
1.32
contradicts
1.28
coincides
1.23
represents
1.22
embodies
1.18
violates
1.18
contains
1.17
resembles
1.16
reflects
1.15
Activations Density 0.216%