I have a question. As we know, ML.NET is amazing framework for .NET, its doing a lot of things by "yourself" so sometimes its hard to get deep into inside.
I have dataset with 30 diffrent features. Im afraid of overfit, so im looking for the easiest way to delete not necessery ones.
For example, if i want to skip first column, can my Data.CS
can look like this?:
//skipped Column 0
[Column(ordinal: "1")]
public float RadiusMean;
[Column(ordinal: "2")]
public float TextureMean;
[Column(ordinal: "3")]
public float PerimeterMean;
I noticed, that we can do this by deleting columns from features;
pipeline.Add(new ColumnConcatenator(outputColumn: "Features",
"TextureMean",
"PerimeterMean",
"AreaMean",
//delete not necessery columns
And by this, we can improve our result. But if it works like "deleting" columns for training model?
Second question, if there is any faster way to make columns? Or maybe there is method in ML.NET to getting columns from dataset?