INDEX
Explanations
mentions of objects or locations being equipped with certain features or amenities
New Auto-Interp
Negative Logits
rase
-0.65
cott
-0.64
女
-0.60
clamation
-0.60
asta
-0.59
amin
-0.59
birth
-0.58
aji
-0.58
é»Ĵ
-0.57
Mub
-0.57
POSITIVE LOGITS
fitted
0.86
equipped
0.81
bodied
0.78
bod
0.77
equipped
0.77
itted
0.75
fitted
0.74
oise
0.73
ioned
0.71
icient
0.71
Activations Density 0.049%