INDEX
Explanations
words related to specific named entities or terms, such as "Chipotle", "biotin", and "bigot"
words related to specific types of food or dietary components
New Auto-Interp
Negative Logits
confir
-0.69
exha
-0.63
desperate
-0.63
nesota
-0.62
continuum
-0.61
inges
-0.60
vow
-0.59
cohorts
-0.59
sentence
-0.59
thirst
-0.59
POSITIVE LOGITS
cot
1.42
chio
1.23
arget
1.06
otle
1.02
olkien
0.97
rans
0.94
pole
0.92
uning
0.91
agon
0.91
oxin
0.89
Activations Density 0.006%