INDEX
Explanations
terms related to fractures or breaking
New Auto-Interp
Negative Logits
ufact
-0.70
kees
-0.65
mington
-0.65
PDATE
-0.64
VP
-0.63
racuse
-0.60
steen
-0.60
slideshow
-0.60
viron
-0.59
heit
-0.58
POSITIVE LOGITS
ract
1.34
ors
1.06
ory
0.91
iced
0.84
ic
0.84
aneous
0.82
ual
0.82
inct
0.82
ect
0.79
ional
0.79
Activations Density 0.005%