INDEX
Explanations
mentions of the word "Mic" or "microwave"
references to specific brands or models of microwaves
New Auto-Interp
Negative Logits
女
-0.75
stadiums
-0.67
blocking
-0.66
agra
-0.66
adders
-0.66
goddess
-0.65
puppies
-0.64
reddits
-0.63
Anfield
-0.63
021
-0.63
POSITIVE LOGITS
Mic
3.96
Mic
3.16
microwave
1.79
microw
1.71
MIC
1.52
mic
1.45
MIC
1.32
Mickey
1.30
mic
1.28
Micro
1.11
Activations Density 0.022%