I have seen that in Machine Learning, the terms "feature" and "label" are used to refer to what I think of as "independent variable" and "dependent variable" (more synonyms from Wikipedia).
The Wikipedia page describing the term "feature" appears to describe independent variables. This discussion also seems to support the idea they are equivalent.
I would like to know if the terms are equivalent and can be used interchangeably. If they are not, what is the difference?
Historical background of the terms would be especially welcome.