INDEX
Explanations
references to camping or outdoor activities
mentions of different camps
New Auto-Interp
Negative Logits
lihood
-0.77
pudding
-0.72
ctive
-0.71
regress
-0.65
nutrit
-0.65
wip
-0.65
magnification
-0.65
antioxid
-0.64
mathemat
-0.62
proble
-0.62
POSITIVE LOGITS
fires
1.19
grounds
1.15
ground
1.14
camp
1.12
fire
1.07
agna
0.97
erness
0.96
anas
0.94
agne
0.91
bell
0.89
Activations Density 0.026%