INDEX
Explanations
occurrences of words related to intervention and interference in various contexts
New Auto-Interp
Negative Logits
living
-0.52
Rept
-0.52
zeich
-0.51
Living
-0.50
Living
-0.49
лъ
-0.47
cipital
-0.47
-0.46
living
-0.46
gallows
-0.46
POSITIVE LOGITS
intervene
1.15
intervened
1.12
intervening
1.01
intervenir
0.98
intervention
0.95
meddling
0.92
Intervention
0.90
Interventions
0.88
interven
0.87
intervi
0.86
Activations Density 0.063%