• Wie zijn wij
    • Wie zijn wij?
    • Ons team
    • Werken bij
    • Case Studies
  • Wat doen we
    • Wat doen we
    • Strategie en team
    • Implementatie
    • Trainingen
    • Support en beheer
    • Detachering
    • Community
  • Technologie
    • Tableau
    • Alteryx
    • Snowflake
    • Matillion
    • Fivetran
  • Evenementen
  • Blogs
  • Contact
Haaksbergweg 75, 1101 BR Amsterdam
info@theinformationlab.nl
020 261 4741
    • Wie zijn wij
      • Wie zijn wij?
      • Ons team
      • Werken bij
      • Case Studies
    • Wat doen we
      • Wat doen we
      • Strategie en team
      • Implementatie
      • Trainingen
      • Support en beheer
      • Detachering
      • Community
    • Technologie
      • Tableau
      • Alteryx
      • Snowflake
      • Matillion
      • Fivetran
    • Evenementen
    • Blogs
    • Contact
The Information Lab Nederland
The Information Lab Nederland
Haaksbergweg 75, 1101 BR Amsterdam
info@theinformationlab.nl
020 261 4741
The Information Lab Nederland
The Information Lab Nederland
The Information Lab Nederland
  • Wie zijn wij
    • Wie zijn wij?
    • Ons team
    • Werken bij
    • Case Studies
  • Wat doen we
    • Wat doen we
    • Strategie en team
    • Implementatie
    • Trainingen
    • Support en beheer
    • Detachering
    • Community
  • Technologie
    • Tableau
    • Alteryx
    • Snowflake
    • Matillion
    • Fivetran
  • Evenementen
  • Blogs
  • Contact

Using your data to predict the future! Part 2

The Information Lab Nederland > Blog > Alle > Using your data to predict the future! Part 2
  • 9 juni 2023
  • Ben Holland
  • Alle
  • 0

Which Algorithm?

Choosing the right predictive tool in Alteryx Designer can be challenging, as there are many different options available and each one has its own strengths and weaknesses. To make the best choice, it’s important to consider the specific needs of your project and the characteristics of the data you are working with. Some factors to consider when choosing a predictive tool include:

 The type of prediction you want to make: Some tools are better suited to making predictions about continuous values, while others are better at predicting discrete classes. Consider the type of outcome you want to predict and choose a tool that is designed for that type of prediction.

  1. The complexity of the data: Some predictive tools are better suited to handling complex, high-dimensional data, while others are better at making predictions based on simple, low-dimensional data. Consider the characteristics of your data and choose a tool that is able to effectively handle the complexity of your data. 
  2. The linearity of the data can affect which machine learning algorithm you should choose in a few different ways.
    1. First, linear algorithms generally perform better on linearly separable data, meaning data that can be separated into different classes or categories by a single straight line. If your data is linearly separable, you may want to consider using a linear model such as logistic regression or linear regression.
    2. Second, non-linear algorithms can often handle more complex, non-linear data better than linear algorithms. If your data is not linearly separable, or if it has a more complex structure, you may want to consider using a non-linear model such as a decision tree, random forest, or a neural network.

By considering these, you can choose the right predictive tool in Alteryx Designer for your project and get the best possible results. 

So, what did I use?

In my project I used to criteria outline above to look at the effectiveness of the different linear algorithms. In the end I selected to look at linear regression, support vector machine (SVM) regression, and count regression. These are all types of regression algorithms, which are used to predict a continuous numeric value based on one or more input variables. In my project, I was looking to predict the number of crashes based on multiple weather variables.

Linear Regression

Linear regression is a simple and widely used technique for modelling the relationship between a dependent variable and one or more independent variables. In linear regression, the model assumes that the relationship between the dependent and independent variables is linear, meaning that the model can be represented by a straight line.

Support Vector Machine (SVM)

Support vector machine (SVM) regression is a type of non-linear regression that uses a different approach to model the relationship between the dependent and independent variables. In SVM regression, the model tries to find the line or hyperplane that maximises the margin between the data points of different classes. This can make SVM regression more effective at handling complex, non-linear data than linear regression.

Count Regression

A count regression is a type of regression that is used to model count data, which is data that represents the number of occurrences of some event (such as the number of clicks on a website or the number of purchases made by a customer). These models are typically used when the dependent variable is a count or an integer value, and the model is used to predict the number of occurrences of some event based on one or more input variables.

In Alteryx Designer you can use these algorithms as individual tools. These tools can be trained and tested by connecting your data that has been cleaned and prepared into the correct format (see part 1 of the series: https://theinformationlab.nl/en/2023/02/10/using-your-data-to-predict-the-future-2/). Each of these tools has specific settings and parameters that you can use to customise the behaviour of the model, such as the type of regularisation to use or the kernel function to use in the case of SVM regression. You can then use the output from these tools to make predictions on new data, or to evaluate the performance of the trained model.

Next time…

Having figured out exactly which tools and algorithms we should be using the next part of methodology is to assess them and determine which is most suitable for the final product. Next time, we will be doing exactly that by using the Score tool. See you then!


Thank you for reading this blog. Also check out our other blogs page to view more blogs on Tableau, Alteryx, and Snowflake here.

Work together  with one of our consultants and maximise the effects of your data. 

Contact us , and we’ll help you right away.

Consultancy

Werk samen met een van onze Consultants en haal het maximale uit jouw data!

Neem contact op met ons om direct hulp te krijgen.

T: +31 20 261 4741
E: info@theinformationlab.nl

Inschrijven nieuwsbrief

Categorieën

  • Tableau
  • Alteryx
  • Snowflake
  • Matillion
  • Fivetran
  • DBT
  • Overige

Recente blog artikelen

  • The Book Club Ep. 4
  • Met cumulatieve waarden werken in Power BI: DATESINPERIOD functie (1)
  • New features on Tableau 2023.3
  • How to take your work to the next level
  • 6 Key Takeaways on How to Talk about Mental Health

The Information Lab

Bij The Information Lab zetten we ons in om bedrijven data gedreven te laten werken. Van onderbuikgevoel naar onderbouwde beslissingen.

Wij zijn betrokken bij alle aspecten van dit proces, van het verstrekken van jouw eerste licentie tot het helpen bij de uitrol binnen de gehele organisatie.

Contactgegevens

  • Haaksbergweg 75, 1101 BR Amsterdam
  • 020 261 4741
  • info@theinformationlab.nl
  • www.theinformationlab.nl

© 2022 Alle rechten voorbehouden.

  • Algemene voorwaarden
  • Cookiebeleid
  • Disclaimer
  • Privacy
We gebruiken cookies om nuttige functies aan te bieden en de prestaties te meten om uw ervaring te verbeteren. Door te klikken op “Cookie instellingen" ga je alleen akkoord met de categorieën die je hebt geselecteerd.
Cookie instellingenAlles accepteren
Manage consent

Cookie overzicht

Deze website maakt gebruik van cookies om uw ervaring te verbeteren terwijl u door de website navigeert. Van deze cookies worden de cookies die als noodzakelijk zijn gecategoriseerd, in uw browser opgeslagen omdat ze essentieel zijn voor de werking van de basisfunctionaliteiten van de website. Wij gebruiken ook cookies van derden die ons helpen te analyseren en begrijpen hoe u deze website gebruikt. Deze cookies worden alleen met uw toestemming in uw browser opgeslagen. U hebt ook de mogelijkheid om u af te melden voor deze cookies. Maar het uitschakelen van sommige van deze cookies kan uw browse-ervaring beïnvloeden.
Necessary
Altijd ingeschakeld
Noodzakelijke cookies zijn absoluut noodzakelijk om de website goed te laten functioneren. Deze cookies zorgen voor basisfunctionaliteiten en beveiligingsfuncties van de website, anoniem.
CookieDuurOmschrijving
cookielawinfo-checkbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functionele cookies helpen om bepaalde functionaliteiten uit te voeren, zoals het delen van de inhoud van de website op social media platforms, het verzamelen van feedback en andere functies van derden.
Performance
Prestatiecookies worden gebruikt om de belangrijkste prestatie-indexen van de website te begrijpen en te analyseren, wat helpt bij het leveren van een betere gebruikerservaring voor de bezoekers.
Analytics
Analytische cookies worden gebruikt om te begrijpen hoe bezoekers omgaan met de website. Deze cookies helpen informatie te verstrekken over het aantal bezoekers, het bouncepercentage, de verkeersbron, enz.
Advertisement
Advertentiecookies worden gebruikt om bezoekers te voorzien van relevante advertenties en marketingcampagnes. Deze cookies volgen bezoekers op verschillende websites en verzamelen informatie om advertenties op maat aan te bieden.
Others
Andere ongecategoriseerde cookies zijn cookies die worden geanalyseerd en nog niet in een categorie zijn ondergebracht.
OPSLAAN & ACCEPTEREN