INDEX
Explanations
mentions of the word "hotel"
instances of the word "hot" in various contexts
New Auto-Interp
Negative Logits
convol
-0.67
anke
-0.67
ISION
-0.66
orrow
-0.66
Commando
-0.65
literacy
-0.64
uthor
-0.64
Fargo
-0.64
Kinn
-0.63
Borders
-0.63
POSITIVE LOGITS
hot
0.88
ter
0.82
rop
0.82
assium
0.80
tery
0.79
ographed
0.78
shot
0.77
rod
0.76
eers
0.76
butt
0.76
Activations Density 0.009%