INDEX
Explanations
words related to trends or patterns
references to trends or patterns
New Auto-Interp
Negative Logits
gur
-0.98
tein
-0.74
oath
-0.68
Lak
-0.65
hiro
-0.64
lungs
-0.63
idden
-0.63
barr
-0.61
oÄŁ
-0.61
Aires
-0.60
POSITIVE LOGITS
setting
1.20
trend
1.05
Trend
1.03
line
0.91
Trend
0.90
trends
0.89
icity
0.83
set
0.83
Trends
0.82
lines
0.81
Activations Density 0.015%