INDEX
Explanations
words related to a specific TV show or franchise
mentions of the TV show "Game of Thrones."
New Auto-Interp
Negative Logits
ought
-0.66
iffe
-0.65
ROR
-0.62
ancies
-0.60
bapt
-0.59
orate
-0.58
maxim
-0.58
attery
-0.57
hips
-0.57
Eisen
-0.57
POSITIVE LOGITS
FAQ
1.13
Cube
1.11
Stop
1.11
zeb
1.06
Spot
1.05
cube
1.04
cock
1.01
Maker
0.95
boy
0.94
Freak
0.90
Activations Density 0.033%