INDEX
Explanations
references to organizations and titles of positions
New Auto-Interp
Negative Logits
forgetting
-0.76
enough
-0.69
inventoryQuantity
-0.64
Newsletter
-0.62
understatement
-0.62
Journal
-0.62
toile
-0.60
Waiting
-0.59
Hobbit
-0.58
Enough
-0.58
POSITIVE LOGITS
consists
1.05
consisted
1.02
originated
1.00
comprised
0.97
aims
0.96
basically
0.94
combines
0.94
consist
0.94
revolves
0.93
essentially
0.91
Activations Density 0.831%