Questions tagged [amazon-machine-learning]

Amazon Machine Learning adds predictive abilities to application such as fraud detection, demand forecasting, targeted marketing, and click prediction. Algorithms create machine learning (ML) models by finding patterns in existing data. The service uses these models to process new data and generate predictions for applications.

58 questions
0
votes
1 answer

Amazon Machine Learning with Categorical Attributes - Limitations

I am using a regression model to predict numeric values from a set of 120 attributes. 7 of these attributes are Categorical, however the largest category has about 90,000 unique values. I am training with approximately 1 million rows of…
Sprooose
  • 504
  • 1
  • 6
  • 17
0
votes
1 answer

Key to boto3.client.get_batch_prediction() OutputURI

I'm trying to download the Output file from boto3.client.get_batch_prediction(). I figure the key is the filename bit of OutputURI so I'm doing client: response = batch.get_batch_prediction() if response['Status'] == 'COMPLETED': …
HenryM
  • 5,557
  • 7
  • 49
  • 105
0
votes
1 answer

Unable to use json.loads on boto3.client.get_batch_predictions()

I'm getting the following error when trying to parse a json response expected string or buffer Within my Django model I have the following: def get_batch_prediction(self): client = boto3.client('machinelearning', region_name=settings.region,…
HenryM
  • 5,557
  • 7
  • 49
  • 105
0
votes
1 answer

If I run a prediction against a model with multiple categories, are the scores split amongst all categories?

Let's say I have created a model with ~30 items for each of 10 categories. I've taken all of the defaults that were provided to me. The Average F1 Score for the model is 0.875 (I have 2 categories that are very closely related, so that's hurting…
James Chevalier
  • 10,604
  • 5
  • 48
  • 74
0
votes
1 answer

Cretae Datasourse machine learning AWS through SDK

I am trying to create a Datasource using AWS Machine learning SDK. At first i'm upload csv data file to S3 storage also throgh the SDK. Then i update bucket policy for this file. Only after all i'm trying create datasource but i receive error…
0
votes
1 answer

Amazon Machine Learning Studio: Adjusted Score threshold is not applied while making real time predictions

I am using Amazon Machine Learning Studio's binary classification algorithm, wherein I have adjusted my score threshold to 0.01 as per my requirement. Thus, all the records for which the prediction score is above 0.01 should predict 1. However,…
Vasanti
  • 1,207
  • 2
  • 12
  • 24
0
votes
1 answer

Create datasource and ML models periodically in Amazon Machine Learning

I have created a data source and trained the machine learning model in Amazon Machine Learning. The data resides in S3 which is used for creating the data source. However, my application has new data added to S3 every second, thus I need a way in…
Vasanti
  • 1,207
  • 2
  • 12
  • 24
0
votes
1 answer

AWS Machine Learning RealTimePredictor returns UnknownoperationException in C#

Using Visual Studio, and AWS .NET V 3.0. I'm trying to perform a real-time Predict operation, and to verify the basic setup works, I first perform a GetMLModel() which works and returns the endpoint (Somewhere in the documentation is was mentioned…
Andy
  • 842
  • 1
  • 12
  • 24
0
votes
0 answers

aws boto3 attributes 100% missing values

When I create a datasource via the python script below, at least one of my attributes has 100% missing values. When I manually create the datasource via the AWS ML dashboard, and apply the same attribute types, none of the values are missing. Is…
0
votes
2 answers

How to interpret performance results of AWS Machine Learning Service?

I am working on a pilot with Amazon Web Service Machine Learning service and I have some soubts. I have used a Binary Classifier model and, in my opinion, the histogram of the results obtained does not match the numerical results. According to the…
-1
votes
1 answer

Best practice to deploy multi models that will run concurrently at scale (something like map reduce)

I have a model that consists 150 models (runs in for loop). In order to be performance oriented, I would like to split it into 150 models, that for every request my server gets it will send 150 api requests to every different model and then combine…
-1
votes
2 answers

Can SageMaker distributed training be used for training non-deep learning models?

I am following this documentation page to understand SageMaker's distributed training feature. It says here that:-  The SageMaker distributed training libraries are available only through the AWS deep learning containers for the TensorFlow,…
-3
votes
1 answer

Which machine learning algorithm to evaluate the best combinations of groups?

Here's a simplified version of the problem: You have a group of five warriors. Each warrior owns between 1 and 10 items, and each can bring 1 of their items to a battle. Given information about the battle (terrain type, time of day, etc.), which…
Eli Brown
  • 63
  • 1
  • 4
1 2 3
4