INDEX
Explanations
specific named entities like "documents."
references to official records or files
New Auto-Interp
Negative Logits
twitch
-0.74
reciation
-0.71
pit
-0.66
ensity
-0.66
rossover
-0.65
obic
-0.65
alty
-0.65
=#
-0.64
chance
-0.63
olation
-0.63
POSITIVE LOGITS
documents
3.62
Documents
2.69
document
2.42
Documents
2.33
papers
2.22
documentation
1.86
docs
1.84
memos
1.84
records
1.80
Document
1.73
Activations Density 0.016%