INDEX
Explanations
instances of the word "rare" and phrases associated with rarity
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.07
3:0.08
4:0.11
5:0.03
6:0.06
7:0.37
8:0.02
9:0.02
10:0.08
11:0.08
Negative Logits
uliffe
-1.97
ohan
-1.55
spir
-1.45
jad
-1.45
xon
-1.45
ascript
-1.44
agall
-1.42
aintain
-1.36
ressing
-1.31
heartedly
-1.31
POSITIVE LOGITS
occurrence
1.75
Uncommon
1.70
Royale
1.53
Rare
1.51
Lot
1.48
arthed
1.41
Classic
1.41
Millennials
1.41
rare
1.38
Niet
1.33
Activations Density 0.014%