INDEX
Explanations
scientific or technical terms related to laws, patents, organizations, and emotional concepts like poisons
New Auto-Interp
Negative Logits
vik
-0.73
DERR
-0.72
entimes
-0.70
erb
-0.67
lf
-0.66
biz
-0.64
irrel
-0.63
pmwiki
-0.61
ternity
-0.61
illy
-0.60
POSITIVE LOGITS
respectively
1.15
finalists
1.00
listed
1.00
vying
0.99
composing
0.99
simultaneously
0.94
consecut
0.93
mentioned
0.90
combined
0.88
viz
0.87
Activations Density 0.297%