INDEX
Explanations
references to the movie or concept of "Avatar"
references to the "Avatar" franchise, particularly related to its characters and concepts
New Auto-Interp
Negative Logits
er
-0.77
meyer
-0.76
theless
-0.74
ancial
-0.72
feed
-0.72
esters
-0.71
fed
-0.71
laws
-0.69
ortun
-0.68
eners
-0.67
POSITIVE LOGITS
Korra
1.03
Roku
0.90
Avatar
0.84
uras
0.83
atar
0.71
riel
0.71
Haku
0.71
ura
0.67
Redditor
0.65
xual
0.65
Activations Density 0.053%