1 Why Pay Attention to Stock Message Boards? 2 a Variety of Stock

Total Page:16

File Type:pdf, Size:1020Kb

1 Why Pay Attention to Stock Message Boards? 2 a Variety of Stock Notes 1 Why Pay Attention to Stock Message Boards? 1. The term quality of life (QOL) references the general well-being of individu- als and societies. 2. www.stocktwits.com 3. http://www.empathica.com/retail2012 4. http://www.accenture.com/us-en/Pages/insight-shopper-preferences.aspx 5. For stocks priced under $1, add 0.5 percent of the principal value to the $7 commission. 6. http://www.sec.gov/News/PressRelease/Detail/PressRelease/1365171513574. Regulation Fair Disclosure is a regulation that was promulgated by the SEC in August 2000. The rule mandates that all publicly traded companies must disclose material information to all investors at the same time. 7. http://www.sec.gov/news/press/2000-135.txt 8. http://www.ftc.gov/opa/2007/06/wholefoods.shtm 9. Penny stocks are usually unlisted, highly speculative, and usually selling for a dollar or less. 10. Asset liquidity refers to how quickly an asset can be converted into cash without a significant loss in value. 11. For the definition of short selling, please visit http://www.sec.gov/answers/ shortsale.htm. Not all the stocks can be shorted. In order to sell the stock short, you must have margin privileges on your brokerage account. Your broker must have available shares to lend to you. You cannot sell short a stock which is under $5. At any time, you must maintain enough capital in your account to place a buy to cover order on your short position to return shares you borrowed to your broker. 2 A Variety of Stock Message Boards 1. Day traders, as defined by the SEC, “rapidly buy and sell stocks through- out the day in the hope that their stocks will continue climbing or falling in value for the seconds to minutes they own the stock, allowing them to lock in quick profits.” Please see http://www.sec.gov/answers/daytrading.htm 268 Notes 2. Trade types include buy, sell, buy to cover, and sell short. Order types include market, limit, stop, stop limit, and trailing stop. 3. Based on Fama and French (1995) and Carhart (1997) research, there are four widely recognized systematic risk factors: stock market premium fac- tor (Mktrf), size factor (SMB), style factor (HML), and momentum factor (UMD). 4. http://www.sec.gov/news/press/2011/2011-268.htm 5. http://www.sec.gov/news/headlines/intmm.htm 6. The ClearStation, a semiprivate chat room, is now an E*TRADE commu- nity. Chatters need to be E*TRADE customer to use the chat room. 7. http://www.sec.gov/news/headlines/tokyojoe2.htm 8. http://www.sec.gov/news/headlines/intmm.htm 9. The “Short” sentiment suggests short selling while the “Scalp” sentiment implies buying and selling quickly with the intent of a day trade profit but without any specific sentiment. 10. Poster’s credit score is a proxy for the poster’s reputation. Such reputation systems have been adopted in a wide range of online applications, includ- ing auction sites such as eBay.com, reseller sites such as Amazon.com and file sharing sites such as YouTube and Flickr. For a detailed explanation of the validity and effectiveness of an online reputation system, see Ghose, Ipeirotis, and Sundararajan (2006). 11. http://hotcopper.com.au/posts.asp?fid=303 12. On Yahoo! Finance, if a message board for a company you are interested in does not exist, you can create one. 13. http://www.siliconinvestor.com/subject.aspx?subjectid=6136 14. One can also sort the board by messages so that all messages will be listed reverse chronologically based on posting time. Sorting by topics is the default setting. 15. Big-Boards, which tracked the most active message boards and forums on the Web, used to be a popular Board of Boards, but it is no longer in service. 3 About Stock Message Board Posters 1. An ISP is a business or organization that offers user access to the Internet and related services, such as Comcast, Verizon, AOL, etc. 2. For example, http://www.thelion.com/bin/disclaimer.cgi and http://raging- bull.com/about 3. To further understand stock “pump-and-dump” manipulation, visit http:// www.sec.gov/investor/pubs/pump.htm 4. In July 1998, the SEC formed the Office of Internet Enforcement, a unit cre- ated to eliminate securities fraud occurring over the Internet. 5. Here is an example of companies and individuals who were charged with security fraud: http://www.sec.gov/news/headlines/intmm.htm 6. https://bulk.resource.org/courts.gov/c/F3/318/318.F3d.465.01-1120.html 7. http://www.thelion.com/bin/disclaimer.cgi 8. http://www.sec.gov/news/digest/1996/dig112096.pdf Notes 269 9. www.sec.gov/news/press/2000-135.txt and www.ftc.gov/opa/2007/06/ wholefoods.shtm 10. http://www.sec.gov/litigation/litreleases/lr15953.txt 11. https://www.sec.gov/litigation/admin/33-7885.htm 12. http://www.sec.gov/litigation/admin/3-9768.txt 13. http://www.sec.gov/litigation/litreleases/lr15855.txt 14. https://www.sec.gov/news/digest/dig090903.txt 15. www.hitwise.com and www.claritas.com 16. In finance, a long position in a security means the holder of the position owns (bought) the security and will profit if the price of the security goes up. In contrast, a short position means that the holder of the position does not own but borrowed the security from the broker and will profit if the price of the security goes down. Going long is the more conventional practice of investing and is contrasted with going short. 17. In the literature, when constructing a sentiment index, a common practice is to assign +2 to “Strong Buy,” +1 to “Buy,” 0 to “Hold” or “Scalp,” –1 to “Sell,” –2 to “Strong Sell,” and –3 to “Short.” 4 Why Do People Post Messages on Stock Message Boards? 1. In the United States, the SEC defines a penny stock as a security that trades below $5 per share, is not listed on a national exchange, and fails to meet other specific criteria. See http://www.sec.gov/answers/penny.htm 5 Modeling the Value of a Stock Message Board 1. A text classifier uses its algorithm to assign a sentiment score to a non-self- disclosed message. Details of text classifiers will be discussed in the next chapter. 2. SPY is SPDR S&P 500 Exchanged Traded Fund while DIA is SPDR Dow Jones Industrial Average Exchanged Traded Fund. 3. Many message boards provide a sentiment indicator for posters to explicitly disclose their sentiment on a voluntary basis. For instance, Yahoo! Finance allows a poster to choose one of the following sentiments: Strong Buy, Buy, Hold, Sell, Strong Sell, or not disclose (by default). TheLion.com offers two more sentiments: Short and Scalp. The Short sentiment suggests short selling while the Scalp sentiment implies buying and selling quickly with the intent of a day-trade profit but without any specific sentiment. Raging Bull also requires the author to specify his or her short-term and long-term sentiments (Tumarkin and Whitelaw 2001). 4. To be more realistic, the value of a message could also depend on the author’s reputation or credibility. Logically, a high-ranked author’s message should contribute a higher value than that of a low-ranked author’s, ceteris paribus. 270 Notes For example, when two social media accounts give out the same information and same recommendation, a higher value should be given to an Associated Press account than an unknown individual’s account. Another example is, one author’s credit score is 100 and the other author has just 1 credit score. If both of them are correct, we can define the value of a message’s value to be 100(+1) = +100 for the former author and 1(+1) = +1 for the latter one. But to keep my derivation simple, I define the value is +1 unit if the message’s sentiment is right and –1 unit otherwise. 5. Shorting is the practice of selling securities or other financial instruments that are not currently owned, with the intention of subsequently repurchas- ing them (“Buy to cover”) at a lower price. 6. In reality, it is possible for an author to have one or more messages deleted by the administrator. However, it is unlikely for one author’s messages to be consistently deleted over a considerable amount of time by the administrator because this type of author would be blocked by the message board. Thus, we do not consider a case with a negative bA in the model. 7. A reputation-recognition mechanism could be implemented either manda- torily by the forum design or voluntarily by all participants in the message board. An effective voluntary reputation-recognition mechanism would occur when an author makes a correct forecast and other participants grant him or her positive credit, and vice versa. 8. Rank’s upper limit 1/|2aF| can be set by the forum developer. Each author’s rank needs to be updated in real time based on his/her cumulative forecast accuracy. 9. http://www.sec.gov/news/headlines/intmm.htm and http://www.sec.gov/ news/headlines/tokyojoe2.htm 6 How to Measure Stock Message Boards’ Activities? 1. Although the Yahoo! Finance message board currently allows posters to dis- close their sentiments, it did not launch its self-disclosed sentiment function until June 2001. Thus, several earlier studies faced data restrictions. For example, studies using year 2000 messages from Yahoo! Finance as their data sample required researchers to hand-classify (hand-code) messages to obtain the training data set.
Recommended publications
  • Social Sentiment Indices Powered by X-Scores
    Social Sentiment Indices powered by X-Scores Brian Davis∗, Keith Cortisy, Laurentiu Vasiliuz, Adamantios Koumpisy, Ross McDermott∗ and Siegfried Handschuhy ∗INSIGHT Centre for Data Analytics, NUI Galway, Ireland Email: [email protected] yUniversitat¨ Passau, Passau, Germany Email: [email protected] zPeracton, Dublin, Ireland Email: [email protected] Abstract—Social Sentiment Indices powered by X-Scores (SSIX) can be incorporated into current operating models as additional seeks to address the challenge of extracting relevant and valuable attributes for executing investment decision-making, with a economic signals in a cross-lingual fashion from the vast variety goal to increase alpha and manage risk for a portfolio. of and increasingly influential social media services; such as Twitter, Google+, Facebook, StockTwits and LinkedIn, and in The European research project Social Sentiment Indices 2 conjunction with the most reliable and authoritative newswires, powered by X-Scores (SSIX) , seeks to assist in this chal- online newspapers, financial news networks, trade publications lenge of incorporating relevant and valuable social media and blogs. A statistical framework of qualitative and quantitative sentiment data into investment decision making by enabling parameters called X-Scores will power SSIX. This framework X-Scores metrics and SSIX indices to act as valid indicators will interpret economically significant sentiment signals that that will help produce increased growth for European Small are disseminated in the social ecosystem. Using X-Scores, SSIX and Medium-sized Enterprises (SMEs). SSIX will extract will create commercially viable and exploitable social sentiment meaningful financial signals in a cross-lingual fashion from a indices, regardless of language, locale and data format.
    [Show full text]
  • Social Sentiment Indices Powered by X-Scores
    ALLDATA 2016 : The Second International Conference on Big Data, Small Data, Linked Data and Open Data (includes KESA 2016) Social Sentiment Indices Powered by X-Scores Brian Davis∗, Keith Cortisy, Laurentiu Vasiliuz, Adamantios Koumpisy, Ross McDermott∗ and Siegfried Handschuhy ∗INSIGHT Centre for Data Analytics, NUI Galway, Ireland Email: [email protected] yUniversitat¨ Passau, Passau, Germany Email: [email protected] zPeracton, Dublin, Ireland Email: [email protected] Abstract—Social Sentiment Indices powered by X-Scores (SSIX) assist in this challenge of incorporating relevant and valuable seeks to address the challenge of extracting relevant and valuable social media sentiment data into investment decision making financial signals in a cross-lingual fashion from the vast variety by enabling X-Scores metrics and SSIX indices to act as of and increasingly influential social media services, such as valid indicators that will help produce increased growth for Twitter, Google+, Facebook, StockTwits and LinkedIn, and in European Small and Medium-sized Enterprises (SMEs). X- conjunction with the most reliable and authoritative newswires, Scores provide actionable analytics in the shape of unique online newspapers, financial news networks, trade publications and blogs. A statistical framework of qualitative and quantitative metrics calculated out of the Natural Language Processing parameters called X-Scores will power SSIX. This framework (NLP) output. SSIX will extract meaningful financial signals will interpret financially significant sentiment signals that are in a cross-lingual fashion from a multitude of social net- disseminated in the social ecosystem. Using X-Scores, SSIX work sources, such as Twitter, Google+, Facebook, StockTwits will create commercially viable and exploitable social sentiment and LinkedIn, and also authoritative news sources, such as indices, regardless of language, locale and data format.
    [Show full text]
  • SSIX Big Data Technologies and Methods for Leveraging Social Sentiment Data in Multiple Business Domains
    February 2017 SSIX Big Data Technologies and Methods for Leveraging Social Sentiment Data in Multiple Business Domains Keith Cortis a, Waqas Khawaja b, Ross McDermott b, Laurentiu Vasiliu c, Adamantios Koumpis a, Siegfried Handschuh a and Brian Davis b a Universitat¨ Passau, Passau, Germany b INSIGHT Centre for Data Analytics, NUI Galway, Ireland c Peracton, Dublin, Ireland Abstract. Social Sentiment Indices powered by X-Scores (SSIX) aims to provide European SMEs with a collection of easy to interpret tools to analyse and under- stand social media users’ attitudes for any given topic. These sentiment character- istics can be exploited to help SMEs operate more efficiently resulting in increased revenues. Social media data represents a combined measure of thoughts and views touching every area of life. SSIX will search and index conversations taking place on social network services, such as Twitter, StockTwits and Facebook, together with the most reliable and trustworthy news agencies, newspapers, blogs and in- dustry publications. A statistical framework of qualitative and quantitative param- eters called X-Scores will power SSIX. Classification and scoring of content will be done using this framework, regardless of language, locale or data architecture. The X-Scores framework will interpret economically significant sentiment signals in social media conversations producing sentiment metrics, such as momentum, breadth, topic frequency, volatility and historical comparison. These metrics will create commercially viable social sentiment indexes, which can be tailored to any domain of interest. By enabling European SMEs to analyse and leverage social sen- timent in their discipline, SSIX will facilitate the creation of innovative products and services by enhancing the investment decision making process, thus assisting in generating increased revenue while also minimising risk exposure.
    [Show full text]
  • Aggregate Attention∗
    Aggregate attention Paul J. Irvine Neeley School of Business Texas Christian University June, 2021 Abstract We study some of the aggregate properties of investor attention to the stock mar- ket. We introduce a framework that constructs a null hypothesis of what rational aggregate attention should look like. This framework states that aggregate attention should be proportional to the aggregate wealth invested in each stock. We find that the distribution of attention at first appears rational. However, much of this attention is directed away from the high market cap names that should attract the most attention. Attention is notably more volatile from month to month than market cap, making attention unpredictable from a market maker perspective This pattern generates pric- ing errors that result in the most popular stocks performing poorly in the upcoming month. This poor performance appears to be a short-term reversal of pricing errors generated by the unpredictable liquidity demands volatile attention can generate. All errors are our own. Preliminary and incomplete. Thanks to David Rakowski, Joey Engelberg, Siyi Shen, and seminar participants at the University of South Carolina. Corresponding author: [email protected]. Aggregate Attention Abstract We study some of the aggregate properties of investor attention to the stock market. We introduce a framework that constructs a null hypothesis of what rational aggregate attention should look like. This framework states that aggregate attention should be proportional to the aggregate wealth invested in each stock. We find that the distribution of attention at first appears rational. However, much of this attention is directed away from the high market cap names that should attract the most attention.
    [Show full text]
  • Thinktwenty20 Winter 2020 the Magazine for Financial Professionals
    Issue No.7 ThinkTWENTY20 Winter 2020 The Magazine for Financial Professionals Automated Audits - Distinguishing Myth From Reality How Social Media Became the Go-To Communication Channels Social Media - The New Reporting Frontier The Surprising Power of Experimentation Twenty-Twenty Vision Number 7, Winter 2020 Editor in Chief: Gerald Trites Managing Editor: Gundi Jeffrey Contributing Editor: Eric E Cohen Email: [email protected] Telephone: (416) 602-3931 Subscription rate, digital edition: $29.95 per year, $44.95 for two years. Individual digital issues: $9.95. To subscribe or buy an issue, go to our online store at https://thinktwenty20-magazine.myshopify.com. ISSN 2563-0113 Cover Photo by Ekaterina Bolovtsova from Pexels Submissions for the magazine are invited from people with an in-depth knowledge of accounting or finance. Submissions can be made by email attachment to [email protected]. Articles should be in Microsoft Word in 1 2 pt Calibri Font. They should be 2000 to 3000 words and be well researched as evidenced by the inclusion of references, which should be numbered and included at the end of the article. Bibliographies are also encouraged. Academic papers with extensive mathematical analyses will not be accepted. We ask for first publication rights, after which copyright returns to the author. Founding Partner TABLE OF CONTENTS Editorial……………………………………………………………………………………………………………………...P. 1 Distinguishing Hype from Reality about the Future of Automated Audits..…P. 3 By Gregory P. Shields, CPA, CA Auditors have been put on notice: artificial intelligence (AI) is here and it’s here to stay. The possibility of AI-enabled machines replacing human auditors is creating some angst.
    [Show full text]
  • Predicting Mental Health from Followed Accounts Predicting Ment
    Predicting Mental Health from Followed Accounts 1 Running Head: Predicting Mental Health from Followed Accounts Predicting Mental Health from Followed Accounts on Twitter Cory K. Costello, Sanjay Srivastava, Reza Rejaie, & Maureen Zalewski University of Oregon Author note: Cory K. Costello, Sanjay Srivastava, and Maureen Zalewski, University of Oregon, Department of Psychology, 1227 University of Oregon, Eugene, OR, 97403. Reza Rejaie, University of Oregon, Department of Computer and Information Science, 1202 University of Oregon, Eugene, OR, 97403. Correspondence concerning this article should be addressed to Cory K. Costello, who is now at the Department of Psychology, University of Michigan, 530 Church Street, University of Michigan, Ann Arbor, MI 48109, email: [email protected] Study materials, analysis code, and registrations (including pre-registered Stage 1 protocol) can be found at the project site: https://osf.io/54qdm/ This material is based on work supported by the National Institute of Mental Health under Grant # 1 R21 MH106879-01 and the National Science Foundation under NSF GRANT# 1551817. Predicting Mental Health from Followed Accounts 2 Abstract The past decade has seen rapid growth in research linking stable psychological characteristics (i.e., traits) to digital records of online behavior in Online Social Networks (OSNs) like Facebook and Twitter, which has implications for basic and applied behavioral sciences. Findings indicate that a broad range of psychological characteristics can be predicted from various behavioral residue online, including language used in posts on Facebook (Park et al., 2015) and Twitter (Reece et al., 2017), and which pages a person ‘likes’ on Facebook (e.g., Kosinski, Stillwell, & Graepel, 2013).
    [Show full text]
  • Three Essays in Financial Economics
    THREE ESSAYS IN FINANCIAL ECONOMICS A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Alan Paul Kwan May 2017 Copyright c 2017 Alan Paul Kwan ALL RIGHTS RESERVED THREE ESSAYS IN FINANCIAL ECONOMICS Alan Paul Kwan, PhD Cornell University 2017 This dissertation explores three different perspectives on frictions that impact the functioning of financial markets. My first and third essay explore information economics in financial mar- kets. My second essay studies the role of regulatory scope and how financial regulation should be implemented. In Chapter 1, “Does Social Media Cause Excess Comovement?”, I study social media’s potential to impact financial markets. When information is costly to produce, information intermediaries specialize in some stocks, creating flows of information and trading among such stocks. Trading by customers results in “excess”, seemingly non-fundamental comovement. Consistent with this theory, I find that co-mentioning of stocks explains increases in comovement. Three different empirical designs point toward a causal interpretation. In Chapter 2 (joint with Chicago Booth PhD students Ben Charoenwong and Tarik Umar), “Who Should Regulate Investment Advisors?”, we study whether national or local regulators best de- ter investment adviser misconduct. Dodd-Frank provides us a laboratory to observe a large re- jurisdiction event in which state regulators below an arbitrary threshold were delegated to state regulation. Consistent with weakened regulation, customer complaint rates increase. The com- plaints represent more severe, not more frivolous reporting. Finally, they precipitate for firms and adviser representatives it might be assumed under the weakest oversight, such as those further from regulators.
    [Show full text]
  • Marketwatch Virtual Stock Exchange
    Analysis: MarketWatch Virtual Stock Exchange 1. EXECUTIVE SUMMARY Key takeaways: ● VSE has been around longer than most competitors ● Heavily supported by MarketWatch and Dow Jones network ○ Dow Jones (especially WSJ) has excellent advertising capabilities ○ MarketWatch provides high-quality news feed content ○ MarketWatch VSE game is “lumped into” MarketWatch generally for advertising purposes ■ VSE probably receives very little standalone attention from corporate owners (or even advertisers, who are only choosing MarketWatch “generally” for advertising purposes) ○ Dow Jones Media Group was formed in 2016 to focus more/better on digital advertising and millennial demographic ● VSE banner advertising brings in revenue, but game does not ○ Opportunity to position as “loss leader”? ● Website traffic ○ Mostly US (not surprising since game trades only US stocks) ○ Page and domain referrals largely come from within Dow Jones network ■ VSE experienced a big jump in referring domains and pages in 2016, which is the same year that Dow Jones Media Group was formed ○ Some game pages get (or got, while they were still active) far more traffic than the main VSE page ○ Note: I’m not a website traffic expert, so perhaps your programmers will infer other takeaways from the website traffic data that I missed ● General user sentiment seems to be that VSE is best for learning or academic settings, but not super fun as a “game” ○ More engaging in group settings (for bragging rights) but not very engaging to use as a single individual ● Recurring user complaint
    [Show full text]
  • NIMO SM How-To Manual
    National Incident Management Organization Social Media How-To Manual for Incidents DRAFT 3/10/10 Version 1b Disclaimer: This document should not be construed as direction or permission to use social media on Federal wildland fires. It is simply a “primer” on how to use a variety of tools for those who want to build skills. Decisions about social media use are between each individual and their agency. 3/10/10 Version 1b NIMO Social Media How-To Manual Table of Contents Listening Tools Blog Search Page 2 Google Reader Page 3 Google News Page 4 Google Alerts Page 6 Addictomatic Page 8 Technorati Page 9 IceRocket Page 10 Twitter Search Page 11 Monitter Page 12 Twitter How to create an account Page 14 Posting a Twitter update Page 16 Twitter-The Lingo Page 17 Twitter-on-the-fly Page 19 Twitter Lists Page 22 Consider a Tweetup Page 26 Twittlonger Page 27 TwitDoc Page 29 Management Tools TweetDeck Page 32 Hootsuite Page 47 Facebook What is a Facebook Page Page 49 Administering a Page Page 51 Some Facebook Best Practices Page 54 Tweeting from your Facebook Page Page 56 Embedding links on Facebook Page 57 Facebook Insights Page 58 Import-Inciweb-Twitter-Feeds Page 60 Photo Management Tools Picasa Page 64 Flickr Page 73 Analytics Tools for Measuring Success Page 78 Videos My Fire Videos How-To Page 80 YouTube Page 81 Blogger Creating an Incident Blog Page 8 3/10/10 Version 1b Listening Tools Google Blog Search Blog Search is Google search technology focused on blogs. Your results include all blogs, not just those published through Blogger; our blog index is continually updated, so you'll always get the most accurate and up-to-date results.
    [Show full text]
  • Howard Lindzon - Transcript
    Howard Lindzon - Transcript Jim: 00:01:49 Well, Hello everyone. It's Jim O'Shaughnessy with my colleague Jamie Catherwood, one of my favorite people. And we've got to actually tell how we met Howard, because I think it's hysterical. Howard Lindzon, social media [inaudible 00:02:08]. CEO, chairman of the board. I mean, my God, you are... Howard Lindzon is social media. Social media is Howard Lindzon. How are you? Howard: 00:02:18 I am just a little bit crappy, but just, I'm excited to be here. Two old guys talking about stuff we barely know [crosstalk 00:02:28] Jim: 00:02:46 So I'm going to tell the story and then you can fill in the color. So I met Howard here in New York at an office open house. And it was our pals at Ritholtz Wealth Management. And they got actually a very nice office for I think a pretty reasonable- Jim: 00:03:07 Nice. Right next to the library. If you know New York, it's really a great location. And so I went up, Patrick was supposed to come, but as usual, I may be 60, but I act like I'm 10, whereas Patrick is 36 and acts like he's much more mature than I am. Anyway, so he had something else to do. And I went and I met you. And you had this vest on. Do you remember? Howard: 00:03:33 I had a three piece suit. I have no idea why.
    [Show full text]
  • Text Mining of Stocktwits Data for Predicting Stock Prices
    Article Text Mining of Stocktwits Data for Predicting Stock Prices Mukul Jaggi * , Priyanka Mandal , Shreya Narang , Usman Naseem and Matloob Khushi * School of Computer Science, The University of Sydney, Sydney, NSW 2006, Australia; [email protected] (P.M.); [email protected] (S.N.); [email protected] (U.N.) * Correspondence: [email protected] (M.J.); [email protected] (M.K.) Abstract: Stock price prediction can be made more efficient by considering the price fluctuations and understanding people’s sentiments. A limited number of models understand financial jargon or have labelled datasets concerning stock price change. To overcome this challenge, we introduced FinALBERT, an ALBERT based model trained to handle financial domain text classification tasks by labelling Stocktwits text data based on stock price change. We collected Stocktwits data for over ten years for 25 different companies, including the major five FAANG (Facebook, Amazon, Apple, Netflix, Google). These datasets were labelled with three labelling techniques based on stock price changes. Our proposed model FinALBERT is fine-tuned with these labels to achieve optimal results. We experimented with the labelled dataset by training it on traditional machine learning, BERT, and FinBERT models, which helped us understand how these labels behaved with different model architectures. Our labelling method’s competitive advantage is that it can help analyse the historical data effectively, and the mathematical function can be easily customised to predict stock movement. Keywords: BERT; FinBERT; ALBERT; NLP; StockTwits; FinALBERT; FAANG; transformer; pre- training; fine-tuning Citation: Jaggi, M.; Mandal, P.; Narang, S.; Naseem, U.; Khushi, M.
    [Show full text]
  • The Predictive Power of Stock Micro- Blogging Sentiment in Forecasting Stock Market Behaviour
    THE PREDICTIVE POWER OF STOCK MICRO- BLOGGING SENTIMENT IN FORECASTING STOCK MARKET BEHAVIOUR This thesis is submitted for the degree of Doctor of Philosophy By Alya Ali AL-Nasseri Brunel Business School Brunel University, London February 2016 ABSTRACT Online stock forums have become a vital investing platform on which to publish relevant and valuable user-generated content (UGC) data such as investment recommendations and other stock-related information that allow investors to view the opinions of a large number of users and share-trading ideas. This thesis applies methods from computational linguistics and text-mining techniques to analyse and extract, on a daily basis, sentiments from stock-related micro-blogging messages called “StockTwits”. The primary aim of this research is to provide an understanding of the predictive ability of stock micro-blogging sentiments to forecast future stock price behavioural movements by investigating the various roles played by investor sentiments in determining asset pricing on the stock market. The empirical analysis in this thesis consists of four main parts based on the predictive power and the role of investor sentiment in the stock market. The first part discusses the findings of the text-mining procedure for extracting and predicting sentiments from stock-related micro-blogging data. The purpose is to provide a comparative textual analysis of different machine learning algorithms for the purpose of selecting the most accurate text-mining techniques for predicting sentiment analysis on StockTwits through the provision of two different applications of feature selection, namely filter and wrapper approaches. The second part of the analysis focuses on investigating the predictive correlations between StockTwits features and the stock market indicators.
    [Show full text]