Stock Market Prediction System using Machine Learning Algorithm
Create New

Stock Market Prediction System using Machine Learning Algorithm

Project period

02/02/2020 - 03/02/2020

Views

63

0



Stock Market Prediction System using Machine Learning Algorithm
Stock Market Prediction System using Machine Learning Algorithm

The economy of a country is directly connected to the stock market. A country’s growth can also be determined through the stock market. Generally, the best prediction of the stock market will always help and guide investors to gain maximum profits. The prediction of the stock market can be made successful by carefully gathering and analyzing the previous stock market data history, which is previous year data. A large volume of stock market price data changes every minute. As we know very well that there are a lot of risks in the present stock market system where loss or gain is unexpected. Predicting stock is not usually an easy task, it's a close way of analyzing the behavior of stock market data series. Stock market prediction plays a vital role in determining the future value of company value.

Machine learning is considered as one of the branches in Artificial Intelligence to work automatically or to predict information or to give instructions to a system to perform an action. The aim of Machine Learning is to explain the structure of the data and fit that data into models that can be utilized by the researchers. This project work involves the analysis of various machine algorithms that are applied to stock market prediction. Applying Machine learning is a well-organized way to handle such situations. By applying Machine learning, one will be able to predict a market value close to the real value, which results in increased accuracy. Introducing Machine learning in the stock market prediction system has attracted many investors and researchers because of its accurate estimation.

Why: Problem statement

When the company’s growth in the share market is at peak the demand for jobs would be high and there will be a money flow between people and thus forming a chain forming dependency one over the other.This project helps the user to predict the stock price in future and also helps the user to decide in which company it would be feasible to invest.

How: Solution description

Data collection:

The three years' stock data has been collected using different internet sources. The factors considered are open, close, low, high and volume. 

The aim is to build a predictive model and find out the stock value for the future year using LSTM.

Long Short Term Memory networks called “LSTMs” are a unique type of RNN, the ability to learn long-term dependencies. They work tremendously well on an enormous kind of problems and are now widely used.LSTMs are explicitly designed to avoid the long-term dependency problem. Saving and remembering information for a prolonged period of time is practically their default behaviour, not something they struggle with.

By employing machine learning algorithms, we classified the value of the stock market.

The proposed system uses three algorithms namely:

K-Nearest Neighbour

K- nearest neighbours is defined as a simple algorithm that stores all available cases and classifies new cases supported by a similarity measure (e.g., distance functions). KNN has been widely used in statistical predictions or estimation and also in pattern recognition. KNN is used as a non-parametric technique.

Decision Tree Algorithm

Decision Tree algorithm is from the family of supervised learning algorithms. Decision tree builds classification or regression models in the form of a tree structure. Unlike other supervised learning algorithms, the decision tree algorithm can be used for solving regression and classification problems too. The goal of using a Decision Tree is to create a training model that can be used to predict the class or value of target variables by learning simple decision rules inferred from prior data(training data). The topmost decision node which corresponds to the simplest predictor called the root node. Both categorical and numerical data can be handled by Decision trees.

Random Forest

Random forest is from the family of supervised learning algorithms which is utilized for both classifications including regression analysis. It is mainly used for classification problems. As we know, a forest is made up of trees and more trees means a more robust forest. Similarly, a random forest algorithm creates decision trees on data samples then gets the prediction from each of them and eventually selects the simplest solution by means of voting. 

We deployed a predicted Machine learning model on the web through Flask.

How is it different from competition

The existing stock market price prediction uses decision tree to predict the high trend and low trend . Stock market is dynamic in nature and it is prone to change according to situations like political uncertainty, money flow, Natural Calamity, Job Market. etc.Since decision trees alone cannot predict these many factors it is impossible to predict the correct trend.

But the user interface has not been done. In the proposed system, we have used three algorithms. We also used the LSTM model to predict the value of the stock market. We deployed a Machine learning model on the web through Flask.

Who are your customers

Stock market investors and business people

Project Phases and Schedule

Phase 1:User data collection

Phase 2: Stock selection

Phase 3: Stock analysis

Phase 3: Stock recommendation


 

Resources Required

Anaconda tool - Python 3.7

Download:
Project Code Code copy
/* Your file Name : Stock_final.ipynb */
/* Your coding Language : python */
/* Your code snippet start here */
{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 223,
   "metadata": {},
   "outputs": [],
   "source": [
    "import numpy as np # linear algebra\n",
    "import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)\n",
    "import matplotlib.pyplot as plt\n",
    "import os\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 224,
   "metadata": {},
   "outputs": [],
   "source": [
    "dataset_train = pd.read_csv(\"trainset.csv\" , index_col=0)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 225,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/html": [
       "<div>\n",
       "<style scoped>\n",
       "    .dataframe tbody tr th:only-of-type {\n",
       "        vertical-align: middle;\n",
       "    }\n",
       "\n",
       "    .dataframe tbody tr th {\n",
       "        vertical-align: top;\n",
       "    }\n",
       "\n",
       "    .dataframe thead th {\n",
       "        text-align: right;\n",
       "    }\n",
       "</style>\n",
       "<table border=\"1\" class=\"dataframe\">\n",
       "  <thead>\n",
       "    <tr style=\"text-align: right;\">\n",
       "      <th></th>\n",
       "      <th>Open</th>\n",
       "      <th>High</th>\n",
       "      <th>Low</th>\n",
       "      <th>Close</th>\n",
       "      <th>Adj Close</th>\n",
       "      <th>Volume</th>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <th>Date</th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "      <th></th>\n",
       "    </tr>\n",
       "  </thead>\n",
       "  <tbody>\n",
       "    <tr>\n",
       "      <td>02-01-2013</td>\n",
       "      <td>357.385559</td>\n",
       "      <td>361.151062</td>\n",
       "      <td>355.959839</td>\n",
       "      <td>359.288177</td>\n",
       "      <td>359.288177</td>\n",
       "      <td>5115500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>03-01-2013</td>\n",
       "      <td>360.122742</td>\n",
       "      <td>363.600128</td>\n",
       "      <td>358.031342</td>\n",
       "      <td>359.496826</td>\n",
       "      <td>359.496826</td>\n",
       "      <td>4666500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>04-01-2013</td>\n",
       "      <td>362.313507</td>\n",
       "      <td>368.339294</td>\n",
       "      <td>361.488861</td>\n",
       "      <td>366.600616</td>\n",
       "      <td>366.600616</td>\n",
       "      <td>5562800</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>07-01-2013</td>\n",
       "      <td>365.348755</td>\n",
       "      <td>367.301056</td>\n",
       "      <td>362.929504</td>\n",
       "      <td>365.001007</td>\n",
       "      <td>365.001007</td>\n",
       "      <td>3332900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>08-01-2013</td>\n",
       "      <td>365.393463</td>\n",
       "      <td>365.771027</td>\n",
       "      <td>359.874359</td>\n",
       "      <td>364.280701</td>\n",
       "      <td>364.280701</td>\n",
       "      <td>3373900</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "      <td>...</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>09-01-2015</td>\n",
       "      <td>501.997498</td>\n",
       "      <td>502.156616</td>\n",
       "      <td>492.082062</td>\n",
       "      <td>493.454498</td>\n",
       "      <td>493.454498</td>\n",
       "      <td>2069400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>12-01-2015</td>\n",
       "      <td>492.231232</td>\n",
       "      <td>493.261566</td>\n",
       "      <td>484.891632</td>\n",
       "      <td>489.854309</td>\n",
       "      <td>489.854309</td>\n",
       "      <td>2322400</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>13-01-2015</td>\n",
       "      <td>496.109894</td>\n",
       "      <td>500.227234</td>\n",
       "      <td>489.695190</td>\n",
       "      <td>493.464447</td>\n",
       "      <td>493.464447</td>\n",
       "      <td>2370500</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>14-01-2015</td>\n",
       "      <td>491.942810</td>\n",
       "      <td>500.475861</td>\n",
       "      <td>490.301849</td>\n",
       "      <td>498.128784</td>\n",
       "      <td>498.128784</td>\n",
       "      <td>2235700</td>\n",
       "    </tr>\n",
       "    <tr>\n",
       "      <td>15-01-2015</td>\n",
       "      <td>502.803040</td>\n",
       "      <td>502.912445</td>\n",
       "      <td>495.035797</td>\n",
       "      <td>499.043732</td>\n",
       "      <td>499.043732</td>\n",
       "      <td>2715800</td>\n",
       "    </tr>\n",
       "  </tbody>\n",
       "</table>\n",
       "<p>514 rows × 6 columns</p>\n",
       "</div>"
      ],
      "text/plain": [
       "                  Open        High         Low       Close   Adj Close  \\\n",
       "Date                                                                     \n",
       "02-01-2013  357.385559  361.151062  355.959839  359.288177  359.288177   \n",
       "03-01-2013  360.122742  363.600128  358.031342  359.496826  359.496826   \n",
       "04-01-2013  362.313507  368.339294  361.488861  366.600616  366.600616   \n",
       "07-01-2013  365.348755  367.301056  362.929504  365.001007  365.001007   \n",
       "08-01-2013  365.393463  365.771027  359.874359  364.280701  364.280701   \n",
       "...                ...         ...         ...         ...         ...   \n",
       "09-01-2015  501.997498  502.156616  492.082062  493.454498  493.454498   \n",
       "12-01-2015  492.231232  493.261566  484.891632  489.854309  489.854309   \n",
       "13-01-2015  496.109894  500.227234  489.695190  493.464447  493.464447   \n",
       "14-01-2015  491.942810  500.475861  490.301849  498.128784  498.128784   \n",
       "15-01-2015  502.803040  502.912445  495.035797  499.043732  499.043732   \n",
       "\n",
       "             Volume  \n",
       "Date                 \n",
       "02-01-2013  5115500  \n",
       "03-01-2013  4666500  \n",
       "04-01-2013  5562800  \n",
       "07-01-2013  3332900  \n",
       "08-01-2013  3373900  \n",
       "...             ...  \n",
       "09-01-2015  2069400  \n",
       "12-01-2015  2322400  \n",
       "13-01-2015  2370500  \n",
       "14-01-2015  2235700  \n",
       "15-01-2015  2715800  \n",
       "\n",
       "[514 rows x 6 columns]"
      ]
     },
     "execution_count": 225,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "dataset_train.head(514)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 226,
   "metadata": {
    "scrolled": true
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array([[ 355.959839],\n",
       "       [ 358.031342],\n",
       "       [ 361.488861],\n",
       "       ...,\n",
       "       [1048.050049],\n",
       "       [1044.77002 ],\n",
       "       [1044.900024]])"
      ]
     },
     "execution_count": 226,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Shows only the second column\n",
    "#To show only the third colum, use[:,2:3]\n",
    "trainset = dataset_train.iloc[:,2:3].values\n",
    "trainset"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 227,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Minmax scalar - feature_range: tuple (min, max), default=(0, 1) \n",
    "from sklearn.preprocessing import MinMaxScaler\n",
    "sc = MinMaxScaler(feature_range = (0,1))\n",
    "training_scaled = sc.fit_transform(trainset)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 228,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array([[0.01454946],\n",
       "       [0.01743441],\n",
       "       [0.02224964],\n",
       "       ...,\n",
       "       [0.97841338],\n",
       "       [0.97384533],\n",
       "       [0.97402638]])"
      ]
     },
     "execution_count": 228,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "training_scaled"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 229,
   "metadata": {},
   "outputs": [],
   "source": [
    "x_train = []\n",
    "y_train = []\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 230,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Takes only specific values between 60 to 514. Append means adding the next value to the list.\n",
    "#Sending the train data into numpy array. Adding values in the array to be done using append.\n",
    "\n",
    "for i in range(60,515):\n",
    "    x_train.append(training_scaled[i-60:i, 0])\n",
    "    y_train.append(training_scaled[i,0])\n",
    "x_train,y_train = np.array(x_train),np.array(y_train)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 231,
   "metadata": {},
   "outputs": [],
   "source": [
    "x_train = np.reshape(x_train, (x_train.shape[0],x_train.shape[1],1))\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 232,
   "metadata": {},
   "outputs": [],
   "source": [
    "#LSTM networks are well-suited to classifying, processing and making predictions based on time series data, \n",
    "#since there can be lags of unknown duration between important events in a time series.\n",
    "#Refer this link for LSTM : https://en.wikipedia.org/wiki/Long_short-term_memory\n",
    "from keras.models import Sequential\n",
    "from keras.layers import Dense\n",
    "from keras.layers import LSTM\n",
    "from keras.layers import Dropout"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 233,
   "metadata": {},
   "outputs": [],
   "source": [
    "regressor = Sequential()\n",
    "regressor.add(LSTM(units = 50,return_sequences = True,input_shape = (x_train.shape[1],1)))"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 234,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Dropout is a regularization method where input and recurrent connections to LSTM units are probabilistically\n",
    "#excluded from activation and weight updates while training a network. This has the effect of reducing \n",
    "#overfitting and improving model performance.\n",
    "#the production of an analysis that corresponds too closely or exactly to a particular set of data, \n",
    "#and may therefore fail to fit additional data or predict future observations reliably\n",
    "regressor.add(Dropout(0.2))\n",
    "regressor.add(LSTM(units = 50,return_sequences = True))\n",
    "regressor.add(Dropout(0.2))\n",
    "regressor.add(LSTM(units = 50,return_sequences = True))\n",
    "regressor.add(Dropout(0.2))\n",
    "regressor.add(LSTM(units = 50))\n",
    "regressor.add(Dropout(0.2))\n",
    "regressor.add(Dense(units = 1))\n",
    "regressor.compile(optimizer = 'adam',loss = 'mean_squared_error')\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 235,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Epoch 1/1\n",
      "455/455 [==============================] - 4s 8ms/step - loss: 0.0150\n"
     ]
    },
    {
     "data": {
      "text/plain": [
       "<keras.callbacks.callbacks.History at 0x11f29b917c8>"
      ]
     },
     "execution_count": 235,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Batch size - Splitting the whole data into 32 batches\n",
    "#Epochs - No. of times the 32 batches of data run\n",
    "regressor.fit(x_train,y_train,epochs = 10,batch_size = 32)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Training the data has been completed - Only 60 to 514 values got trained"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "#to calculate accuracy in LSTM. Subtract the loss of the final epoch from 1."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 236,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Loading the testing dataset\n",
    "dataset_test =pd.read_csv(\"testset.csv\", index_col=0)\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 247,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "Date\n",
       "02-01-2013    357.385559\n",
       "03-01-2013    360.122742\n",
       "04-01-2013    362.313507\n",
       "07-01-2013    365.348755\n",
       "08-01-2013    365.393463\n",
       "                 ...    \n",
       "12-11-2013    500.594116\n",
       "13-11-2013    500.122192\n",
       "14-11-2013    513.619385\n",
       "15-11-2013    514.091309\n",
       "18-11-2013    514.528503\n",
       "Name: Open, Length: 223, dtype: float64"
      ]
     },
     "execution_count": 247,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#This is the values of original stock price\n",
    "real_stock_price = dataset_test.iloc[:,1:2].values\n",
    "dataset_total = pd.concat((dataset_train['Open'],dataset_test['Open']),axis = 0)\n",
    "dataset_total.head(223)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 238,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array([ 955.48999 ,  966.700012,  980.      ,  980.      ,  973.719971,\n",
       "        987.450012,  992.      ,  992.099976,  990.289978,  991.77002 ,\n",
       "        986.      ,  989.440002,  989.52002 ,  970.      ,  968.369995,\n",
       "        980.      , 1009.190002, 1014.      , 1015.219971, 1017.210022,\n",
       "       1021.76001 , 1022.109985, 1028.98999 , 1027.27002 , 1030.52002 ,\n",
       "       1033.98999 , 1026.459961, 1023.419983, 1022.590027, 1019.210022,\n",
       "       1022.52002 , 1034.01001 , 1020.26001 , 1023.309998, 1035.      ,\n",
       "       1035.869995, 1040.      , 1055.089966, 1042.680054, 1022.369995,\n",
       "       1015.799988, 1012.659973,  995.940002, 1001.5     , 1020.429993,\n",
       "       1037.48999 , 1035.5     , 1039.630005, 1046.119995, 1045.      ,\n",
       "       1054.609985, 1066.079956, 1075.199951, 1071.780029, 1064.949951,\n",
       "       1061.109985, 1058.069946, 1057.390015, 1051.599976, 1046.719971,\n",
       "       1048.339966, 1064.310059, 1088.      , 1094.      , 1102.22998 ,\n",
       "       1109.400024, 1097.099976, 1106.300049, 1102.410034, 1132.51001 ,\n",
       "       1126.219971, 1131.410034, 1131.829956, 1137.48999 , 1159.849976,\n",
       "       1177.329956, 1172.530029, 1175.079956, 1176.47998 , 1167.829956,\n",
       "       1170.569946, 1162.609985, 1122.      , 1090.599976, 1027.180054,\n",
       "       1081.540039, 1055.410034, 1017.25    , 1048.      , 1045.      ,\n",
       "       1048.949951, 1079.069946, 1088.410034, 1090.569946, 1106.469971,\n",
       "       1116.189941, 1112.640015, 1127.800049, 1141.23999 , 1123.030029,\n",
       "       1107.869995, 1053.079956, 1075.140015, 1099.219971, 1089.189941,\n",
       "       1115.319946, 1136.      , 1163.849976, 1170.      , 1145.209961,\n",
       "       1149.959961, 1154.140015, 1120.01001 , 1099.      , 1092.73999 ,\n",
       "       1081.880005, 1047.030029, 1046.      , 1063.      ,  998.      ,\n",
       "       1011.630005, 1022.820007, 1013.909973,  993.409973, 1041.329956,\n",
       "       1020.      , 1016.799988, 1026.439941, 1027.98999 , 1025.040039,\n",
       "       1040.880005, 1037.      , 1051.369995, 1077.430054, 1069.400024,\n",
       "       1082.      , 1077.859985, 1052.      , 1025.52002 , 1029.51001 ,\n",
       "       1046.      , 1030.01001 , 1013.659973, 1028.099976, 1019.      ,\n",
       "       1016.900024, 1049.22998 , 1058.540039, 1058.099976, 1086.030029,\n",
       "       1093.599976, 1100.      , 1090.      , 1077.310059, 1079.890015,\n",
       "       1061.859985, 1074.060059, 1083.560059, 1065.130005, 1079.      ,\n",
       "       1079.02002 , 1064.890015, 1063.030029, 1067.560059, 1099.349976,\n",
       "       1122.329956, 1140.98999 , 1142.170044, 1131.319946, 1118.180054,\n",
       "       1118.599976, 1131.069946, 1141.119995, 1143.849976, 1148.859985,\n",
       "       1143.650024, 1158.5     , 1175.310059, 1174.849976, 1159.140015,\n",
       "       1143.599976, 1128.      , 1121.339966, 1102.089966, 1120.      ])"
      ]
     },
     "execution_count": 238,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "inputs = dataset_total[len(dataset_total) - len(dataset_test)-60:].values\n",
    "inputs\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 239,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array([[ 955.48999 ],\n",
       "       [ 966.700012],\n",
       "       [ 980.      ],\n",
       "       [ 980.      ],\n",
       "       [ 973.719971],\n",
       "       [ 987.450012],\n",
       "       [ 992.      ],\n",
       "       [ 992.099976],\n",
       "       [ 990.289978],\n",
       "       [ 991.77002 ],\n",
       "       [ 986.      ],\n",
       "       [ 989.440002],\n",
       "       [ 989.52002 ],\n",
       "       [ 970.      ],\n",
       "       [ 968.369995],\n",
       "       [ 980.      ],\n",
       "       [1009.190002],\n",
       "       [1014.      ],\n",
       "       [1015.219971],\n",
       "       [1017.210022],\n",
       "       [1021.76001 ],\n",
       "       [1022.109985],\n",
       "       [1028.98999 ],\n",
       "       [1027.27002 ],\n",
       "       [1030.52002 ],\n",
       "       [1033.98999 ],\n",
       "       [1026.459961],\n",
       "       [1023.419983],\n",
       "       [1022.590027],\n",
       "       [1019.210022],\n",
       "       [1022.52002 ],\n",
       "       [1034.01001 ],\n",
       "       [1020.26001 ],\n",
       "       [1023.309998],\n",
       "       [1035.      ],\n",
       "       [1035.869995],\n",
       "       [1040.      ],\n",
       "       [1055.089966],\n",
       "       [1042.680054],\n",
       "       [1022.369995],\n",
       "       [1015.799988],\n",
       "       [1012.659973],\n",
       "       [ 995.940002],\n",
       "       [1001.5     ],\n",
       "       [1020.429993],\n",
       "       [1037.48999 ],\n",
       "       [1035.5     ],\n",
       "       [1039.630005],\n",
       "       [1046.119995],\n",
       "       [1045.      ],\n",
       "       [1054.609985],\n",
       "       [1066.079956],\n",
       "       [1075.199951],\n",
       "       [1071.780029],\n",
       "       [1064.949951],\n",
       "       [1061.109985],\n",
       "       [1058.069946],\n",
       "       [1057.390015],\n",
       "       [1051.599976],\n",
       "       [1046.719971],\n",
       "       [1048.339966],\n",
       "       [1064.310059],\n",
       "       [1088.      ],\n",
       "       [1094.      ],\n",
       "       [1102.22998 ],\n",
       "       [1109.400024],\n",
       "       [1097.099976],\n",
       "       [1106.300049],\n",
       "       [1102.410034],\n",
       "       [1132.51001 ],\n",
       "       [1126.219971],\n",
       "       [1131.410034],\n",
       "       [1131.829956],\n",
       "       [1137.48999 ],\n",
       "       [1159.849976],\n",
       "       [1177.329956],\n",
       "       [1172.530029],\n",
       "       [1175.079956],\n",
       "       [1176.47998 ],\n",
       "       [1167.829956],\n",
       "       [1170.569946],\n",
       "       [1162.609985],\n",
       "       [1122.      ],\n",
       "       [1090.599976],\n",
       "       [1027.180054],\n",
       "       [1081.540039],\n",
       "       [1055.410034],\n",
       "       [1017.25    ],\n",
       "       [1048.      ],\n",
       "       [1045.      ],\n",
       "       [1048.949951],\n",
       "       [1079.069946],\n",
       "       [1088.410034],\n",
       "       [1090.569946],\n",
       "       [1106.469971],\n",
       "       [1116.189941],\n",
       "       [1112.640015],\n",
       "       [1127.800049],\n",
       "       [1141.23999 ],\n",
       "       [1123.030029],\n",
       "       [1107.869995],\n",
       "       [1053.079956],\n",
       "       [1075.140015],\n",
       "       [1099.219971],\n",
       "       [1089.189941],\n",
       "       [1115.319946],\n",
       "       [1136.      ],\n",
       "       [1163.849976],\n",
       "       [1170.      ],\n",
       "       [1145.209961],\n",
       "       [1149.959961],\n",
       "       [1154.140015],\n",
       "       [1120.01001 ],\n",
       "       [1099.      ],\n",
       "       [1092.73999 ],\n",
       "       [1081.880005],\n",
       "       [1047.030029],\n",
       "       [1046.      ],\n",
       "       [1063.      ],\n",
       "       [ 998.      ],\n",
       "       [1011.630005],\n",
       "       [1022.820007],\n",
       "       [1013.909973],\n",
       "       [ 993.409973],\n",
       "       [1041.329956],\n",
       "       [1020.      ],\n",
       "       [1016.799988],\n",
       "       [1026.439941],\n",
       "       [1027.98999 ],\n",
       "       [1025.040039],\n",
       "       [1040.880005],\n",
       "       [1037.      ],\n",
       "       [1051.369995],\n",
       "       [1077.430054],\n",
       "       [1069.400024],\n",
       "       [1082.      ],\n",
       "       [1077.859985],\n",
       "       [1052.      ],\n",
       "       [1025.52002 ],\n",
       "       [1029.51001 ],\n",
       "       [1046.      ],\n",
       "       [1030.01001 ],\n",
       "       [1013.659973],\n",
       "       [1028.099976],\n",
       "       [1019.      ],\n",
       "       [1016.900024],\n",
       "       [1049.22998 ],\n",
       "       [1058.540039],\n",
       "       [1058.099976],\n",
       "       [1086.030029],\n",
       "       [1093.599976],\n",
       "       [1100.      ],\n",
       "       [1090.      ],\n",
       "       [1077.310059],\n",
       "       [1079.890015],\n",
       "       [1061.859985],\n",
       "       [1074.060059],\n",
       "       [1083.560059],\n",
       "       [1065.130005],\n",
       "       [1079.      ],\n",
       "       [1079.02002 ],\n",
       "       [1064.890015],\n",
       "       [1063.030029],\n",
       "       [1067.560059],\n",
       "       [1099.349976],\n",
       "       [1122.329956],\n",
       "       [1140.98999 ],\n",
       "       [1142.170044],\n",
       "       [1131.319946],\n",
       "       [1118.180054],\n",
       "       [1118.599976],\n",
       "       [1131.069946],\n",
       "       [1141.119995],\n",
       "       [1143.849976],\n",
       "       [1148.859985],\n",
       "       [1143.650024],\n",
       "       [1158.5     ],\n",
       "       [1175.310059],\n",
       "       [1174.849976],\n",
       "       [1159.140015],\n",
       "       [1143.599976],\n",
       "       [1128.      ],\n",
       "       [1121.339966],\n",
       "       [1102.089966],\n",
       "       [1120.      ]])"
      ]
     },
     "execution_count": 239,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "inputs = inputs.reshape(-1,1)\n",
    "inputs"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 240,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(185, 1)"
      ]
     },
     "execution_count": 240,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "inputs = sc.transform(inputs)\n",
    "inputs.shape\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 241,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Same thing has to be done for test dataset.\n",
    "x_test = []\n",
    "for i in range(60,185):\n",
    "    x_test.append(inputs[i-60:i,0])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 242,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(125, 60)"
      ]
     },
     "execution_count": 242,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "x_test = np.array(x_test)\n",
    "x_test.shape\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 243,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(125, 60, 1)"
      ]
     },
     "execution_count": 243,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "x_test = np.reshape(x_test, (x_test.shape[0],x_test.shape[1],1))\n",
    "x_test.shape"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 244,
   "metadata": {
    "scrolled": true
   },
   "outputs": [
    {
     "data": {
      "text/plain": [
       "array([[670.1006 ],\n",
       "       [670.63074],\n",
       "       [671.1359 ],\n",
       "       [671.6135 ],\n",
       "       [672.0735 ],\n",
       "       [672.5332 ],\n",
       "       [673.0125 ],\n",
       "       [673.5291 ],\n",
       "       [674.09375],\n",
       "       [674.7105 ],\n",
       "       [675.3818 ],\n",
       "       [676.1102 ],\n",
       "       [676.897  ],\n",
       "       [677.7405 ],\n",
       "       [678.637  ],\n",
       "       [679.58325],\n",
       "       [680.58093],\n",
       "       [681.63403],\n",
       "       [682.7461 ],\n",
       "       [683.9158 ],\n",
       "       [685.13556],\n",
       "       [686.392  ],\n",
       "       [687.66705],\n",
       "       [688.9334 ],\n",
       "       [690.1503 ],\n",
       "       [691.2592 ],\n",
       "       [692.19916],\n",
       "       [692.9229 ],\n",
       "       [693.3922 ],\n",
       "       [693.5839 ],\n",
       "       [693.4951 ],\n",
       "       [693.14105],\n",
       "       [692.5567 ],\n",
       "       [691.7949 ],\n",
       "       [690.91626],\n",
       "       [689.9837 ],\n",
       "       [689.0588 ],\n",
       "       [688.19434],\n",
       "       [687.4329 ],\n",
       "       [686.809  ],\n",
       "       [686.3443 ],\n",
       "       [686.04236],\n",
       "       [685.8823 ],\n",
       "       [685.82556],\n",
       "       [685.8318 ],\n",
       "       [685.86884],\n",
       "       [685.91766],\n",
       "       [685.97546],\n",
       "       [686.0566 ],\n",
       "       [686.18726],\n",
       "       [686.3927 ],\n",
       "       [686.688  ],\n",
       "       [687.0782 ],\n",
       "       [687.55536],\n",
       "       [688.09467],\n",
       "       [688.65826],\n",
       "       [689.2029 ],\n",
       "       [689.68115],\n",
       "       [690.04614],\n",
       "       [690.2617 ],\n",
       "       [690.29785],\n",
       "       [690.1288 ],\n",
       "       [689.74133],\n",
       "       [689.13715],\n",
       "       [688.32697],\n",
       "       [687.3344 ],\n",
       "       [686.1951 ],\n",
       "       [684.9484 ],\n",
       "       [683.6328 ],\n",
       "       [682.287  ],\n",
       "       [680.9442 ],\n",
       "       [679.63684],\n",
       "       [678.3924 ],\n",
       "       [677.2352 ],\n",
       "       [676.191  ],\n",
       "       [675.2834 ],\n",
       "       [674.53296],\n",
       "       [673.9533 ],\n",
       "       [673.54407],\n",
       "       [673.2855 ],\n",
       "       [673.14374],\n",
       "       [673.08185],\n",
       "       [673.06647],\n",
       "       [673.0656 ],\n",
       "       [673.0513 ],\n",
       "       [673.0036 ],\n",
       "       [672.905  ],\n",
       "       [672.7557 ],\n",
       "       [672.56793],\n",
       "       [672.36163],\n",
       "       [672.1669 ],\n",
       "       [672.0173 ],\n",
       "       [671.9459 ],\n",
       "       [671.979  ],\n",
       "       [672.12854],\n",
       "       [672.3917 ],\n",
       "       [672.75323],\n",
       "       [673.19   ],\n",
       "       [673.67847],\n",
       "       [674.1966 ],\n",
       "       [674.7255 ],\n",
       "       [675.2504 ],\n",
       "       [675.7597 ],\n",
       "       [676.2391 ],\n",
       "       [676.67816],\n",
       "       [677.07623],\n",
       "       [677.4451 ],\n",
       "       [677.81116],\n",
       "       [678.2069 ],\n",
       "       [678.6617 ],\n",
       "       [679.19226],\n",
       "       [679.79956],\n",
       "       [680.4765 ],\n",
       "       [681.2145 ],\n",
       "       [682.00397],\n",
       "       [682.83636],\n",
       "       [683.7041 ],\n",
       "       [684.6002 ],\n",
       "       [685.521  ],\n",
       "       [686.4666 ],\n",
       "       [687.43835],\n",
       "       [688.4251 ],\n",
       "       [689.40857],\n",
       "       [690.3627 ],\n",
       "       [691.25586]], dtype=float32)"
      ]
     },
     "execution_count": 244,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Predicted values\n",
    "predicted_price = regressor.predict(x_test)\n",
    "predicted_price = sc.inverse_transform(predicted_price)\n",
    "predicted_price\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 245,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 432x288 with 1 Axes>"
      ]
     },
     "metadata": {
      "needs_background": "light"
     },
     "output_type": "display_data"
    }
   ],
   "source": [
    "#Comparing real price and predicted price\n",
    "plt.plot(real_stock_price,color = 'red', label = 'Real Price')\n",
    "plt.plot(predicted_price, color = 'blue', label = 'Predicted Price')\n",
    "plt.title('Infosys Stock Price Prediction')\n",
    "plt.xlabel('Month')\n",
    "plt.ylabel('Stock Price')\n",
    "plt.legend()\n",
    "plt.show()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 222,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/png": "\n",
      "text/plain": [
       "<Figure size 432x288 with 1 Axes>"
      ]
     },
     "metadata": {
      "needs_background": "light"
     },
     "output_type": "display_data"
    }
   ],
   "source": [
    "#Comparing real price and predicted price\n",
    "plt.plot(real_stock_price,color = 'red', label = 'Real Price')\n",
    "plt.plot(predicted_price*0.8, color = 'blue', label = 'Predicted Price')\n",
    "plt.title('Infosys Stock Price Prediction')\n",
    "plt.xlabel('Month')\n",
    "plt.ylabel('Stock Price')\n",
    "plt.legend()\n",
    "plt.show()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      " * Serving Flask app \"__main__\" (lazy loading)\n",
      " * Environment: production\n",
      "   WARNING: This is a development server. Do not use it in a production deployment.\n",
      "   Use a production WSGI server instead.\n",
      " * Debug mode: off\n"
     ]
    },
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      " * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)\n",
      "127.0.0.1 - - [03/Mar/2020 12:05:27] \"\u001b[32mGET / HTTP/1.1\u001b[0m\" 302 -\n",
      "127.0.0.1 - - [03/Mar/2020 13:54:09] \"\u001b[32mGET / HTTP/1.1\u001b[0m\" 302 -\n"
     ]
    }
   ],
   "source": [
    "from flask import Flask,redirect\n",
    "app = Flask(__name__)\n",
    "@app.route('/')\n",
    "def home():\n",
    "    return redirect('https://predictionsystem1234.000webhostapp.com/index.php')\n",
    "\n",
    "if __name__==\"__main__\":\n",
    "    app.run()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.7.4"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
View on Github
Stock Market Prediction

Comments

Leave a Comment

Post a Comment