INDEX
Explanations
words related to home improvement tasks and appliances
references to household items and appliances
New Auto-Interp
Negative Logits
Submission
-0.72
ERO
-0.67
д
-0.66
BTC
-0.66
Reconstruction
-0.65
Æ
-0.65
Haram
-0.62
lez
-0.62
ICT
-0.61
LAT
-0.61
POSITIVE LOGITS
cloth
1.02
aptop
0.96
idges
0.89
mattress
0.88
tops
0.86
wagen
0.78
humid
0.78
sofa
0.77
washer
0.77
ampoo
0.77
Activations Density 0.284%