INDEX
Explanations
references to thrift shopping or secondhand stores
New Auto-Interp
Negative Logits
thia
-0.77
lar
-0.76
tics
-0.69
sei
-0.67
STEM
-0.66
cled
-0.65
dayName
-0.64
ORED
-0.64
ATOR
-0.62
Hanson
-0.61
POSITIVE LOGITS
ifty
0.98
letcher
0.89
ift
0.88
itude
0.87
ieth
0.86
imore
0.83
inho
0.79
initely
0.79
unction
0.78
withstanding
0.75
Activations Density 0.059%