Assessing the usefulness of online message board mining in automatic stock prediction systems
We provide evidence of the usefulness of exploiting online text data in stock prediction systems. We do this by mining a popular Argentinian stock message board and empirically answering two questions. First, is there information in the online stock message board useful for predicting stock returns?...
Guardado en:
Autores principales: | , |
---|---|
Formato: | JOUR |
Materias: | |
Acceso en línea: | http://hdl.handle.net/20.500.12110/paper_18777503_v19_n_p43_Galvez |
Aporte de: |
id |
todo:paper_18777503_v19_n_p43_Galvez |
---|---|
record_format |
dspace |
spelling |
todo:paper_18777503_v19_n_p43_Galvez2023-10-03T16:34:18Z Assessing the usefulness of online message board mining in automatic stock prediction systems Gálvez, R.H. Gravano, A. Latent semantic analysis Random forest Ridge regression Stock market Text mining Classification (of information) Costs Data mining Decision trees Electronic trading Financial markets Forecasting Investments Learning systems Regression analysis Semantics Classification system Latent Semantic Analysis Predictive information Predictive models Random forests Ridge regression Technical indicator Text mining Online systems We provide evidence of the usefulness of exploiting online text data in stock prediction systems. We do this by mining a popular Argentinian stock message board and empirically answering two questions. First, is there information in the online stock message board useful for predicting stock returns? Second, if useful information is found, is it novel or it is simply a different way of expressing information already available in the past behavior of stock prices? To address these questions, we build and validate a series of predictive models using state-of-the-art machine learning and topic discovery techniques. Running experiments in which the models are trained with different combinations of features extracted from the past behavior of stock prices, or mined from the online message boards. Evidence suggests that it is possible to extract predictive information from stock message boards. Furthermore, we find that adding this information improves the performance of classification systems trained solely on technical indicators. Our results suggest that information from online text data is complementary to the one available in the past evolution of stock prices. Additionally, we find that highly predictive features derived from the message board data seem to have an important and relevant semantic content. © 2017 Elsevier B.V. Fil:Gravano, A. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina. JOUR info:eu-repo/semantics/openAccess http://creativecommons.org/licenses/by/2.5/ar http://hdl.handle.net/20.500.12110/paper_18777503_v19_n_p43_Galvez |
institution |
Universidad de Buenos Aires |
institution_str |
I-28 |
repository_str |
R-134 |
collection |
Biblioteca Digital - Facultad de Ciencias Exactas y Naturales (UBA) |
topic |
Latent semantic analysis Random forest Ridge regression Stock market Text mining Classification (of information) Costs Data mining Decision trees Electronic trading Financial markets Forecasting Investments Learning systems Regression analysis Semantics Classification system Latent Semantic Analysis Predictive information Predictive models Random forests Ridge regression Technical indicator Text mining Online systems |
spellingShingle |
Latent semantic analysis Random forest Ridge regression Stock market Text mining Classification (of information) Costs Data mining Decision trees Electronic trading Financial markets Forecasting Investments Learning systems Regression analysis Semantics Classification system Latent Semantic Analysis Predictive information Predictive models Random forests Ridge regression Technical indicator Text mining Online systems Gálvez, R.H. Gravano, A. Assessing the usefulness of online message board mining in automatic stock prediction systems |
topic_facet |
Latent semantic analysis Random forest Ridge regression Stock market Text mining Classification (of information) Costs Data mining Decision trees Electronic trading Financial markets Forecasting Investments Learning systems Regression analysis Semantics Classification system Latent Semantic Analysis Predictive information Predictive models Random forests Ridge regression Technical indicator Text mining Online systems |
description |
We provide evidence of the usefulness of exploiting online text data in stock prediction systems. We do this by mining a popular Argentinian stock message board and empirically answering two questions. First, is there information in the online stock message board useful for predicting stock returns? Second, if useful information is found, is it novel or it is simply a different way of expressing information already available in the past behavior of stock prices? To address these questions, we build and validate a series of predictive models using state-of-the-art machine learning and topic discovery techniques. Running experiments in which the models are trained with different combinations of features extracted from the past behavior of stock prices, or mined from the online message boards. Evidence suggests that it is possible to extract predictive information from stock message boards. Furthermore, we find that adding this information improves the performance of classification systems trained solely on technical indicators. Our results suggest that information from online text data is complementary to the one available in the past evolution of stock prices. Additionally, we find that highly predictive features derived from the message board data seem to have an important and relevant semantic content. © 2017 Elsevier B.V. |
format |
JOUR |
author |
Gálvez, R.H. Gravano, A. |
author_facet |
Gálvez, R.H. Gravano, A. |
author_sort |
Gálvez, R.H. |
title |
Assessing the usefulness of online message board mining in automatic stock prediction systems |
title_short |
Assessing the usefulness of online message board mining in automatic stock prediction systems |
title_full |
Assessing the usefulness of online message board mining in automatic stock prediction systems |
title_fullStr |
Assessing the usefulness of online message board mining in automatic stock prediction systems |
title_full_unstemmed |
Assessing the usefulness of online message board mining in automatic stock prediction systems |
title_sort |
assessing the usefulness of online message board mining in automatic stock prediction systems |
url |
http://hdl.handle.net/20.500.12110/paper_18777503_v19_n_p43_Galvez |
work_keys_str_mv |
AT galvezrh assessingtheusefulnessofonlinemessageboardmininginautomaticstockpredictionsystems AT gravanoa assessingtheusefulnessofonlinemessageboardmininginautomaticstockpredictionsystems |
_version_ |
1782027638080012288 |