INDEX
Explanations
book titles and academic subjects
New Auto-Interp
Negative Logits
potentially
0.93
healthcare
0.87
a
0.86
deep
0.85
IS
0.84
societal
0.79
resilience
0.76
mindset
0.76
the
0.75
real
0.75
POSITIVE LOGITS
<unused2189>
1.22
]|
1.16
𒂮
1.13
<unused290>
1.02
<unused1882>
1.02
𒐸
1.02
<unused2089>
1.02
<unused2115>
1.01
<unused305>
1.00
<unused203>
0.99
Activations Density 0.007%