INDEX
Explanations
negative situations or actions
punctuations and symbols indicating section breaks or end of thought
New Auto-Interp
Negative Logits
slump
-0.72
ĪĴ
-0.68
exhib
-0.67
undermin
-0.67
princ
-0.66
tenancy
-0.66
isot
-0.65
administ
-0.65
exha
-0.65
exhibitions
-0.65
POSITIVE LOGITS
<|endoftext|>
1.30
↵↵
1.19
↵
1.13
********************************
1.07
Anyway
0.99
âĶľâĶĢâĶĢ
0.94
Originally
0.92
[/
0.91
↵Âł
0.89
||||
0.89
Activations Density 0.113%