INDEX
Explanations
phrases indicating possible contact information or references to external sources
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.08
4:0.07
5:0.08
6:0.08
7:0.07
8:0.08
9:0.09
10:0.08
11:0.09
Negative Logits
Carnival
-2.74
DX
-2.70
Syndicate
-2.67
Mania
-2.56
Magic
-2.56
subreddit
-2.50
Rune
-2.47
mods
-2.47
Minor
-2.39
Ethiop
-2.35
POSITIVE LOGITS
Cath
2.65
eur
2.64
lyak
2.58
Coul
2.54
Cath
2.51
lon
2.49
wl
2.47
ipal
2.44
imeter
2.42
……………………
2.42
Activations Density 0.000%