INDEX
Explanations
phrases related to valuable or significant collections of information, particularly leaks
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.07
3:0.04
4:0.09
5:0.03
6:0.04
7:0.32
8:0.04
9:0.04
10:0.14
11:0.08
Negative Logits
SPA
-1.66
MODE
-1.56
commute
-1.55
tilt
-1.50
pronounced
-1.49
cancel
-1.48
curfew
-1.46
disappro
-1.44
superv
-1.44
harmon
-1.41
POSITIVE LOGITS
trove
2.64
archives
2.17
caches
2.08
cache
2.04
geries
1.92
unearthed
1.90
treasure
1.86
riches
1.86
documents
1.83
abilia
1.83
Activations Density 0.001%