INDEX
Explanations
references to camps or camping activities
New Auto-Interp
Negative Logits
imore
-0.19
anced
-0.17
oldemort
-0.16
ancer
-0.16
imal
-0.16
annis
-0.16
annes
-0.16
zsche
-0.15
lijke
-0.15
lep
-0.15
POSITIVE LOGITS
site
0.37
grounds
0.31
fires
0.29
fire
0.27
ground
0.25
agne
0.23
bell
0.23
aigned
0.23
ers
0.23
erv
0.23
Activations Density 0.010%