INDEX
Explanations
phrases related to discussing, proposing, or evaluating ideas or plans
references to significant ideas or concepts
New Auto-Interp
Negative Logits
record
-0.80
audits
-0.78
Coverage
-0.76
essional
-0.75
penalties
-0.73
adelphia
-0.72
Penalty
-0.71
Records
-0.70
reporting
-0.70
LIA
-0.69
POSITIVE LOGITS
floated
1.18
hatched
1.17
appealing
0.98
revived
0.97
gaining
0.95
embraced
0.95
entertained
0.95
prevalent
0.94
debunked
0.94
concept
0.94
Activations Density 0.245%