AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

Query search on assortment websites by GMDH using neural network while not CRAWL to get rid of pretend rank websites

Zuber Ansari

Department of Information Technology, TIT Excellence, RGPV University, Bhopal – 462001

[email protected]

ABSTRACT Internet is an assortment of immense data. In the modern internet era everyone search on internet for some information. To explore the internet Search Engines are primary targets. Everyone uses Search Engine from find a name of website for information. Search Engines have primary responsibility to search query related information and rank them according to relevant information. This vast need of Search Engines makes them an important tool for this challenging task. Search engine directly compare keywords with URI, , CONTENT, REFBACK LINK, web pages against the user query and a web page which has most matched keyword density, results high rank in Search Engine results. In that case many websites want to show their websites on first page against user query search keywords. To get first page ranking Search Engine Optimization is a method to boost ranking of website. Search Engine Optimization is a technique that by pass the Google search results rules as well as shows as first page rank website to the crawler when crawl web pages of website. In this era many fake ranking websites show on first page with irrelevant to keyword as well as user mistrusted websites. In this project we using a neural network method to filter query on website group or set of group and predict relevant website after that search on that website for query. As well as this proposed system work without crawler.

Keywords: SEO, SEF, KEYWORD, URI, METADATA, CONTENT, REFBACK LINK, Crawler, Crawl

1. INTRODUCTION Search Engine plays an important role to extract information from internet in the form of ranked web pages result. Searched results top few results got hits form users and low ranked pages. Web page ranking can be improved with Search Engine Optimization, which helps in analysis of Search Engine rules and develop a web page according to Search Engine ranking rules. SEO is a practice which helps to improve web page’s popularity as well as content, which easily recognized by Search Engine. But judge a web page’s quality and popularity is totally a responsibility of Search Engine. Every Search Engine works with some searching algorithms and strategies. On the behalf of searching techniques Search Engine can be divided into two major categories: Keyword Based Search and Semantic Search. Keyword Based Search focuses on the web pages keywords and directly matched with query keywords. Before starting search on internet Search Engines register every web page with their personal database. Search Engines spider are located over the web and make searching of web pages on web easier. Every Search Engine firstly parse the web pages, ignore all the stop words like „the‟ „is‟ „am‟ „are‟ etc. Every article lies between and tags, where the page id, title, and text is separated by the corresponding tags. [2] [18]

Interrogative words are removed from the query and page ranking is performed on the basis of main keywords of the query. Every Search Engine performs following three tasks:

A. Inverted Index: After parsing all web pages documents Search Engine Create an inverted index for all documents which speed up information retrieval. Example in below table „computer‟ is a term for inverted index and corresponding „ [1, [0, 5, 15]] ‟ is an index for „laptop‟ time period in inverted index database. Here 1 factor to net web page index and [0, 5, 15] factors to term’s index inside web page. Finally search starts towards inverted indexed database. [12] [21]

11| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

(TERM) (DOCID) Computer [ [1, [0, 5, 15]], [2[0, 7, 17]] Literature [[4, [0, 6, 13]], [5[0, 19]] Spirituality [[7, [0, 9, 23]] Search Engines are mainly three types: Full Text, Meta and Directory Search Engine. Search engine have two major searching techniques used to search web pages on the Internet and store them into Search Engine’s personal database and ranking in SERP. [4]

B. Query Analysis: User puts a query to search some type of information on the internet. Question works as a feed to look Engine, and explicit consumer s view for seek. On the idea of query keywords question can be divided into three classes: Navigational, Transactional and Informational. Query evaluation is approaches which refine question with shorten key phrases. Interrogative phrases are removed from the question and web page ranking is performed on the idea of primary key phrases of the question. [4]

C. Ranking Web Pages: Finally Search Engine rank a web page with some predefined policy, strategies. Ranking a web page may depend on several factors of web pages like Popularity, contents, structure of web pages etc. After search on personal database Search Engine finally rank web pages according to their content frequency matched with query keywords. [4]

On the premise of query all search engines may be categorize as keyword based Search Engine or Semantic search Engine. Both types of Search Engines follow all the upper steps but difference is based on the searching techniques. [4]

Keyword Based Search: This search mainly focus on the query keywords and match theses query keywords directly with the web page keywords without any proper query analysis or knowledge. Search mainly points out the title , anchor text, page heading tag, Meta tag’s keywords, anchor tag, ALT tag, URL. To optimize a web page in search result keywords density, distribution and keywords selection is more important. Every value in a field considered as a small text document that can be used for the Keyword Based Search. Keyword matching process finds a set of tuple in database, which has matched keywords in user query. With the help of inverted index it is easy to determine matched keywords and their index in a webpage. A web page which has exact query matched keywords and has more strength of query related keywords gets high rank in search result. Because of Search Engine gives priority to keywords so Black Hat techniques. Keyword Based Search Engines are Google, Yahoo, and MSN etc. [15] [17] [19]

Semantic Search: This is also called a search with meanings. The Semantic Search determines the content and contextual meaning of query keywords. The Semantic Search Engine interpreted extracting the relevant concept from the sentence. A query also itself determines the user’s view. Interrogative words are used to change the meaning of query like what, why, when etc. The Semantic Search Engines are an alternative to the Keyword Based Search Engines. The distinction of Semantic Search Engines from Conventional Search Engines is that the Semantic Search Engines are which means-based totally. Semantic search is combined with conventional key-word-primarily based retrieval to reap tolerance to understanding base incompleteness. The Semantic Search uses a knowledge base to extend the user query. A user uses shortcut keywords but want more results, so this is the Search Engine’s responsibility to extend the user’s query to find relevant results to user query. Knowledge base is a group of related terms which has similar type of related terms. Any keyword which exists in knowledge base can be a part of user query indirectly. Knowledge base mostly may be in form of database with different related terms collection. Each term has pre assigned weight in a database. Term’s weight further used to calculate the web page’s weight for ranking in search results. The Semantic Search also retains the exact meaning of query and relate with results. [16] [20]

1.1 SEARCH ENGINE OPTIMIZATION SEO is the manner to get your site higher ranking to show on first page in search results of search engines. SEO is a necessary service as the Internet is evolving at a rapid pace and there is greater than before competition for market go halves to the top positions in various search engines to reach. Through research help, it was shown that most people prefer the first ten search outcome as if they traverse the list of search engines and rarely go for results. SEO is one of the most overlooked areas of marketing today. You are having the older generation still trying to advertise on billboards, TV commercials, advertisements or radio. In the meantime the younger age group consumed with the hype of Face book, Twitter and additional sites of social networking. There are lots of SEO techniques with best quality that are able to achieve the desired results. SEO is the best publicity method that is much longer term. You can never climb in the rankings of Google within days. Typically it takes several months before seeing a boost in your position. But on one occasion you can even get to the peak of the search engines and you can even receive a constant stream of hundreds. The objective of this research paper was to prove that implementing search engine optimisation elements that are increasing Website traffic and online visibility .For Website usability seo attributes is essential to improve rankings [14]

12| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

1.2 HOW DOES A SEARCH ENGINE WORK? Search engines perform several activities in order to deliver search results. [14]

 Crawling - Process of fetching all the web pages linked to a website. This task is performed by software called a crawler or a spider (or Google bot, in case of Google).  Indexing - Process of creating index for all the fetched web pages and keeping them into a giant database from where it can later be retrieved. Essentially, the process of indexing is identifying the words and expressions that best describe the page and assigning the page to particular keywords. Processing -When a search request comes, the search engine processes it, i.e., it compares the search string in the search request with the indexed pages in the database.  Calculating Relevancy-It is likely that more than one page contains the search string, so the search engine starts calculating the relevancy of each of the pages in its index to the search string.  Retrieving Results-The last step in search engine activities is retrieving the best matched results. Basically, it is nothing more than simply displaying them in the browser. Search engines such as Google and Yahoo! often update their relevancy algorithm dozens of times per month. When you see changes in your rankings, it is due to an algorithmic shift or something else beyond your control. Although the basic principle of operation of all search engines is the same, the minor differences between their relevancy algorithms lead to major changes in the relevancy of results. 1.3 WHAT IS SEARCH ENGINE RANK? When you search any keyword using a search engine, it displays thousands of results found in its database. A page ranking is measured by the position of web pages displayed in the search engine results. If a search engine is putting your web page on the first position, then your web page rank will be number 1 and it will be assumed as the page with the highest rank.SEO is the process of designing and developing a website to attain a high rank in search engine results. [14]

2. SEO PROCSESS There are several segments of SEO strategy seen as optional that are actually absolutely imperative to the success of an SEO campaign [3] [5] [6] [7]

 Website analysis.  Competitive Analysis.  Proper Keyword Research.  Content Development.  On-page Optimisation.  Off-page Optimisation.  Return on Investment Analysis.  Local SEO.  Daily, weekly and monthly reports.

Fig 2.1 show traffic drive different category TYPES OF SEO: 2.1 On page optimization

2.2 Off page optimization 13| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

2.1 ON PAGE OPTIMIZATION On page optimization is the process of making a website SEO friendly and lot of points keep in mind when we doing the on page of any website like title tag optimization, meta description of that website, navigation structure of a website and more. [3] [5] [6] [7] 2.2 OFF PAGE OPTIMIZATION Off Page Optimization is the activity that is done on other sites for our site to increase search engine ranking. Off page is what you do to promote your website like link building, , posting, directory submission, smo etc. SEO techniques are classified into two broad categories: White Hat SEO – Techniques that search engines recommend as part of a good design. Black Hat SEO – Techniques that search engines do not approve and attempt to minimize the effect of. These techniques are also known as spam indexing. Latest SEO Techniques to Drive Targeted Traffic to Your Website Even though people spend a lot of their online time on social network sites , search engines like Google, Bing and Yahoo are still the primary drivers of traffic to lots of websites. Approximately 80% of traffic on the internet is due to search engines. One of the best SEO techniques is Keep Updating Your website as search engines index those websites quickly& rank them higher which are updated regularly. After Google Panda Update content is termed as King to rank higher on the search engines. Be careful about the density of keywords and phrases and maintain a balance between what visitors and search engines looking, you need to always talk about your blog or website. Organizing online contests giving a prize for the best web design, or the best design for a logo, always gives publicity to the organizer. So use your own ways to be famous as this will increase traffic to your website. Publish about your website or blog in newspapers, magazines, etc., Attend conferences and seminars. If somebody read your article in the newspaper is very probable that visits your site. Word of mouth is still an effective technique to promote your website. [3] [5] [6] [7]

Fig 2.2 show traffic drive different category 2.3 HOW DOES MODERN SEO TECHNIQUE WORK? Search engines have existed for about 2 decades now, and many people know how they work. Up until a few years ago, SEO used to be a simple case of finding a some keywords and then building links based on them. This resulted in a lot of spam and low quality search results on Google. [8] [9] [10] [11]

Fig 2.2 show how search engine work 14| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

Search engines have updated their algorithms to detect unethical SEO tactics. Most websites that used shady methods to increase their ranks on search engines were severely penalized. Here are some SEO techniques that used to work but do not anymore: queried.

 Building links through link farms  Building links through content farms  Stuffing articles and web pages with keywords  Building an excessive number of links in a short period of time  Using various article marketing techniques and ignoring all other legitimate techniques

There are better ways to optimize your website and make them more popular on search engines. Here are some techniques that do work.

Guest posting: This is a very recent technique and it has grown very popular over the last 3-4 years. Guest posts are published on websites and other than yours. They have interesting content for readers of the website. 2.4 WHY DO I NEED GUEST BLOGGING? Positive and constructive comments from consumers help maintain a respectable business image, and they are also a valuable advertising method. Improve your site’s rankings and increase your incoming traffic. The guest post contains links to your website that readers can follow Your website is exposed to a wider audience, which can drive further traffic Guest posts are usually of higher quality. Guest posting improve rank through search engine holders. [8] [9] [10] [11] 2.4.1 BLOGGING Even though social network has supplemented blogging to a large extent, it is still the best way to increase readership. Social network posts are very casual and they are great for conversations. However, when you post blogs that have genuine and interesting content, people are more interested in reading them. 2.4.2 FACEBOOK ACTIVITY Facebook is a constant in people’s lives. They use the network to find out what’s going on around the world. They use it to find interesting news and you can use this to promote your products as well. Posting regular discounts and offers on Facebook will garner more attention and drive traffic to your website. 2.4.3 LEVERAGING TWITTER Twitter is probably the most impactful SEO tool in the last 3 years. It started as a small status update network, Tweets are short and they are more easily digestible for visitors. 2.4.4 LINK BUILDING Along with proper keyword research, link building forms one of the pillars of search engine optimization. The basic principle used by all search engines to gauge your popularity is still counting the number of links that point to your website. However, the rules have changed considerably over the last 3 years. Here is what search engines want from your website: Back links to your website must be organic (and not part of an excessive strategy) Links should appear on quality websites (as determined by their Page Rank)If many links are found on your website on link farms and content farms, Google and Bing will downgrade your rankings severely High quality link building will lead your web pages to appear in the top 10 or 20 search results on search engines 2.4.5 USING NICHE LONG TAIL KEYWORDS People increasingly search on mobile devices. Earlier, people used to only look for information about certain topics on the internet. Today, they use it to look for: Products and services Nearby restaurants, movies and other places Search is becoming increasingly localized. Therefore, if your business is based on Melbourne, you need to include those locations in your keyword research as well. Examples include: Red running shows in Melbourne (rather than simply “red running shows”)You need to use generic keywords as well as long tail niche keywords as mentioned above. This will help your website appear on top when people are looking for local places and businesses. 2.4.6 UPGRADE YOUR WEBSITE The last 3 years have seen modern web standards including HTML5 and CSS3 emerge. These technologies make it easier for search engines to parse your website content and index information. Upgrading our website is the single best thing you can to do increase your visibility on search engines. 2.4.7 MAINTAIN YOUR LINKS As time goes on, many websites are simply not updated. Some websites go down. Even in existing websites, some pages are deleted. However, the links to these pages are not always taken down. These are called dead links and the resulting state is called 1| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425 link rot. You need to ensure that all the links on your websites are active and that dead links are removed. You do not need to go hunt for individual links because there are many tools that can help you automate this process. Running through this process every week will help ensure that search engines always find your website to be high quality. 2.4.8 UPDATE YOUR WEBSITE ACTIVITY Lastly, search engines want websites to be active. If your site is not regularly updated with new content, it is considered to be inactive and therefore of little value to search engine users. Keep your website fresh with new posts and design, so Google and other engines will index your website on a daily basis. This way, users will also get the latest content on your website from search engines. SEO has indeed become a very time consuming job, but that is because there are so many factors in play. You might want to hire a reliable SEO company to help you form a concrete strategy. 2.5 WHY IS SEO IMPORTANT? As an SEO expert, we are well acquainted with the fact that developing your business online is not just having a website. SEO is like electrical energy for your online firm, and making business through your site does not take place on its own. Most of the users who are in search of your products or services utilize a Search Engine such as Google to come across you and your contenders. Now, if your website does not have an ongoing / current SEO strategy, you obstruct the ability for your site to be located over the search engines like Google. If your opponents are becoming visible more than you on key terms relating to your business, do not forget to ask your SEO advisors as to what all can be done regarding it. 2.5.1 WHY DO WE NEED ONGOING SEO? As an SEO expert, we are well acquainted with the fact that developing your business online is not just having a website. SEO is like electrical energy for your online firm, and making business through your site does not take place on its own. Ongoing SEO is important because the search engine industry changes very often. It helps organizations to proactively build links over time. It helps you to frequent view and analyze the analytics. Regular reporting and evaluation ensure objectives are being achieved or at least moving in the right direction. 2.5.2 IS ON-PAGE SEO MORE IMPORTANT THAN OFF-PAGE SEO? On-page is anything you directly control in the code or content of your site. On-Page or on-site optimization can be beneficial initially as you are looking for good rankings for your website or blog and can be crucial when you are close to getting that ultimate number one position in Google. But all through the process of getting your site to number 1 you should still pay close attention to on-site optimisation. However it is Off-Page or off-site optimization that is really critical to getting good rankings in www.google.in order to start thinking on how you can promote your website you need to be sure that the website is optimized and in good condition. So the first step is to work with on -site SEO first and then go off-site. All your website content – pages, blogs, articles, etc. Title tags, meta tags, and H1 tags Your user interface, layout, and all aspects of design Alt text for your images The keywords you use, and their usage density (do not assume more is better!)How fresh, unique, and current your content is The time it takes for your website to load URL structure, and whether or not the contain your keywords XML sitemap – it should be comprehensive and dynamic A full indexing of your site (you can select pages that you do not wish to be indexed by search engines) Internal linking (linking to other pages within your site) and navigationRobots.txt file (this tells the search engines to index your site)301 redirects 2.5.3 OFF-SITE SEO is therefore all the link building and marketing techniques you employ outside the context of your own website. These include (but are not limited to):Link building; back-linking and inbound Press releases and other media articles that directly reference your website Any article directory submissions you have made Competitor analysis Your social media posts and related marketing Keyword analysis – the popularity and accuracy of your keywords Leaving comments and links on other sites, blogs, forums, and videos Submissions to SEO-friendly directories Videos placed on sites like YouTube, properly optimized RSS feeds AdWords and other paid keyword campaigns Track backs IV. 2.6 ADVANTAGES OF SEO FREE TARGETED TRAFFIC The primary advantage of SEO over any other type of internet marketing. Once your site is ranked in the top 10 for your keyword phrase, you should be able to keep it there with a minimum of fuss. This means that you will receive free traffic as long as you keep it up 2.6.1 SUPERB ROI Return on investment is one of the major advantages of SEO over paid advertising. A site takes some time to get rank and after the site ranked, the ROI will be great. 2.6.2 COST EFFECTIVENESS SEO is one of the most cost-effective ways of marketing. If the site is properly designed and optimized then it has a longer standings s compared to the Pay Per Click Advertising. 2| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

2.6.3 BETTER USABILITY The site is easily available to the large portion of the online users. A better optimized and designed website always attracts lot of visitors 2.6.4 HIGHER SALES Increased visibility, cost effectiveness and accessibility leads to the higher sales. 2.6.5 60% OF CLICKS GO TO THE FIRST RESULT This means that only 40% of clicks are left for the second through the millionth result on google for the keyword. Securing that top spot on Google is a sure fire way to gain thousands upon thousands of visitors. SEO is certainly the tool needed to gain that top spot as well. For a small up front investment you are looking at potentially millions of sales. 2.6.6 THE RESULTS ARE LOW COST (IN COMPARISON TO ADWORDS AND PPC)

Organic listings are essentially free. When you are listed at the top, you don’t need to pay per click or allocate a budget for advertising, one of the main benefits of SEO is that it is the gift that keeps on giving. With a little bit of effort (an d some money upfront to pay for SEO costs) you can watch your website get consistent traffic. You don’t have to pay $10 for every person who clicks on your ad. Unlike paid ads, your traffic will not drop to nothing when it stops. SEO gets rid of the need to have thousands of ads across the web.

3 LITERATURE SURVEY Here some search engine optimization algorithm below: 3.1 ZHOU HUI Highlight sin “Study on Website Search Engine Optimization”, the factor effecting searches ranking of search engine optimization. This paper presents the three factor Webpage correlation, Link Weight and Time Based Factors of SEO with three types of Search Engines Full Text Search Engine, Meta Search Engine and Directory Search Engine. According to this paper Keyword optimization can be done with Keyword Selection, Keyword Density and Keyword Distribution. A web page optimization can be described with both internal and external links which purely describe web page popularity. An effective way to optimize a web page is to submit it to Search Engine or submit to open directory library for future indexing. [4] 3.2 -TSAI CHUNG A web server design using search engine optimization techniques for web intelligence for small organizations” this research paper highlights the ideal model for developing web servers for small organizations by including search engine optimization techniques. As we now today is a globalization era, a new way of business and marketing is developed by the use of Internet web services. Search engine optimization is the process of improving the visibility of a website I the search engine result via “natural” search result. Small organization such as schools, banks, government agencies, libraries, retailers, restaurants, post offices could build their web services in higher quality for improving their business in surviving in today’s competitive world. [1] 3.3 S. G. CHOUDHARY Semantic search algorithm based on page rank and ontology: A review this research paper introduces some Semantic techniques and provides different algorithms based on page ranking and ontology. The algorithms based on Semantic Search are the following ways: PSSE: Personalized Semantic Search Engine use the user‟s profile with the ranking score and ontology for calculation of personalized factor which helps to get more personalized result. Google search engine use user profile to personalized search result for their location and on the basis of their previous search interest result. The architecture of PSSE has two parts: Offline and Online. The Offline part consists of crawling and pre-processing processes. [20] 3.4 SHIKHA GOEL “Search Engine Evaluation Based on Page Level Keywords” this Paper highlights the approach of search engine evaluation which is based on page level keywords. Page level keywords are the keywords found in individual pages of website. Page level keyword is an impotent factor to measurethe relevance of search engine results. A user create a query and search engine designer design the database for this query and later the queries are run by the users to calculate the page level keywords. [13] 3.5 PHYO THU THU KHINE Keyword Searching and Browsing System over Relational Database this paper highlights the searching of keywords in relational database for increase the searching speed of desired keywords. A user doesn’t need the knowledge of database schema or SQL. A

3| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425 user submit a list of keywords them system search for the relevant records and ranking them on their occurrence basis. Three systems DISCOVER, BANKS and DB Xplorer user keyword based search over relational database. A query gives a set of keywords and finds rows in relation database for keywords. Indexing Relational Database: Indexing is used to speed up the retrieval of records. Indexing is useful when database has large number of Text fields. Each value in such a column considered as a small text documents that can be used for keyword based search. Query Cleaning: System takes a query as input and produces a „clean‟ query output. This is achieved by filtering the stop words from query. These words are meaningless. So the result occur with them may not satisfy the user. Keyword matching: Once the cleaned query is produced, this system can match the keywords. System matches the query keywords with database tuples. A keyword matching algorithm may different for multiple keywords queries. Record Scoring: After query results, calculation of the score for each result is need. The record is determines which record is relevant to user query. This process is also called ranking process of documents in result. [17] [23] 3.6 DUYGU TUMER An Empirical Study on Semantic Search Performance of Keyword-Based and Semantic Search Engine: Google, Yahoo, Msn and Hakia this paper analyzes the semantic search performance of search engines. This paper took three keyword based search engines Google, Yahoo, Msn and a Semantic search engine Hakia. Different queries of different topics analyze the performance of these search engines. Web search engines are computer programs which allow users to search their desired information from websites. The most popular search engines are Google, Yahoo, and Msn with 71.9, 71.7% and 4.2 volume of search ratio respectively. Hakia is the publicly available semantic search engine. This paper has a table with ten different types of queries. These queries were run on the both keyword-based search engine as well as semantic search engine. Keywords were used to replace phrase. Yahoo and Msn retrieved approximately 75.5%, 63% and 78% non-relevant documents respectively. Hakia retrieved 62.5% non- relevant documents

Both Keyword Based Search and Semantic Search deal with query keywords and web pages keywords. A Keyword is a key point in both searching techniques. A Comparative analysis of both techniques will show the merits and demerits of both techniques against each other. Comparison can be done on the basis of Time taken by a particular search and Accuracy of produced results against query. For analysis we have taken 20 keywords as shown I table 1for query to both types of searches against hundred different web pages used for experimental results. A combination of more than one keyword can be used to find relevant results. [22]

Sr. Query Keywords 1 Compute 2 Computer 3 Search 4 Density 5 Poem 6 Essay 7 RAM 8 math 9 Research 10 Cricket 20 IPL 11 ROM 12 science 13 Literature 14 Spirituality 15 Spiritual science1 16 Literature is science1 17 GOD 18 English Literature 19 SEO 20 Punjabi mp3 TABLE 3.1: Selected Query Keywords

All above mention keywords we put one by one on both searching techniques and store experimental results separately for time and accuracy. We apply all upper keywords in the following Keyword Based Search algorithm one by one for experimental results. Every query keywords directly matched with web page keywords in Search engine inverted indexed database. 3.8 ALGORITHM FOR KEYWORD BASED SEARCH The algorithm describing the sequence of computation steps involved in present work is given below [13]:-Search KeywordBasedSearch(string KEYWORD)

1. Read a key from the user input. 2. Pickup the page from the page repository.

4| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

3. Match each key with the entire existing keyword list with each web page database. 4. Calculate the frequency of keyword in the page. 5. Show keyword frequency in keyword table. 6. Repeat the step from 2 to 5 for each key. 7. Find a page with high frequency of keywords in a keyword table. 8. Show results from high to low frequency keyword pages.

The Semantic Search uses a knowledge base to extend the user query keywords. Knowledge base is a collection of related terms, so thousands of related terms stored in knowledge base previously. A web page’s rank in the Semantic Search can be calculated with following algorithm. [13] 3.9 ALGORITHM FOR THE SEMANTIC SEARCH The algorithm describing the sequence of computation steps involved in present work is given below [13]:-Search SemanticSearch(string KEYWORD)

1. Read a key from the user input 2. Match it from the entire existing keyword list in the knowledge base. 3. Pickup the page from the page repository. 4. Match each key with the entire existing keyword list in each web page database. 5. Calculate the frequency of keyword in a web page. 6. Show keyword frequency in keyword table. 7. Repeat the step from 2 to 5 for each key. 8. Find a page with high frequency of keywords in a keyword table. 9. Show results from high weighted pages to low weighted pages.

4 PROPOSED WORK To explore the internet Search Engines are primary targets. Internet is an assortment of immense data. In the modern internet era everyone search on internet for some information. Everyone uses Search Engine from find a name of website for information. Search Engines have primary responsibility to search query related information and rank them according to relevant information. This vast need of Search Engines makes them an important tool for this challenging task. Search engine directly compare keywords with URI, METADATA, CONTENT, REFBACK LINK, web pages against the user query and a web page which has most matched keyword density, results high rank in Search Engine results. In that case many websites want to show their websites on first page against user query search keywords. To get first page ranking SEO is a method to boost ranking of website. In this era many fake ranking websites show on first page with irrelevant to keyword as well as user mistrusted websites. In this project we using a neural network method to filter query on website group or set of group and predict relevant website after that search on that website for query.

We intend to develop a technique to search keywords on direct targeted group websites without Crawl web pages of website, as well as we intend to develop a search process that search keywords on that website with or without ranking based search that get rid of pretend rank websites results. This research is based on reviewing different available techniques for optimizing individual web-pages or the entire website to make them search engine friendly. Hera they use some technique on page and off page for improve ranking site and presented some challenging problem faced by existing search engines. There are many of On-Page and Off-Page Search Engine Optimization Techniques.

An effective and accurate system for search engine optimization for obtaining a higher rank for the websites in the search results webmaster how top rank their website using Google panda search engine update to enhance their website web page ranking in top position. Both techniques are directly depends on user query keywords. A user’s query keyword(s) works as feed for these both techniques, only text is a way to be searched for, no image or graph search support. The Semantic Search is more effective then the keyword search. But a time complexity of the Semantic Search is a big issue. The Keyword based search is also effective if user has some knowledge about searching fields. Otherwise the keyword search produced keyword bounded results which may not appropriate for a user. With same keywords both techniques produces different results. The Semantic Search uses knowledge base and produces more relevant results. Future scope for this research is to refine the Semantic Search time complexity. A mixture of both techniques can be defined to take advantages of both techniques. With the mixture of both techniques there are possibilities to overcome the Black Hat techniques which bypass the Search Engine rules.

5 APPLICATIONS OF PROPOSED SYSTEM 1. There is no crawler required to crawl each and every web pages of websites to crawl. 2. There is no need to find out internal links as well external links of websites. 3. There is no need of stored database of crawl web pages Meta data information. 4. Huge amount of crawler bandwidth save for both parties, Search Engine as well as Websites. 5| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

5. Get rid of pretend rank websites results. 6. This proposed system includes only result oriented web pages for keyword in search results as well as exclude fake ranking web pages for keyword in search result. 7. This proposed system also work for Plagiarism Checker for research field as well as ranking of one words, two words, three word, four words, five words, and N words plagiarism. 8. Required a crawler to crawl at least website submission as well as required a webmaster tool for keyword generation. 9. Required structured data format in web pages to search keywords on web pages of websites. 10. This proposed system work only for limited area of search queries like Research Area, Medical Research, Publication, Academic, and indexing abstracting system etc.

CONCLUSIONS We intend to develop a technique to search keywords on direct targeted group websites without Crawl web pages of website, as well as we intend to develop a search process that search keywords on that website with or without ranking based search that get rid of pretend rank websites results. This research is based on reviewing different available techniques for optimizing individual web-pages or the entire website to make them search engine friendly. Hera they use some technique on page and off page for improve ranking site and presented some challenging problem faced by existing search engines. There are many of On-Page and Off-Page Search Engine Optimization Techniques.

An effective and accurate system for search engine optimization for obtaining a higher rank for the websites in the search results webmaster how top rank their website using Google panda search engine update to enhance their website web page ranking in top position. Both techniques are directly depends on user query keywords. A user’s query keyword(s) works as feed for these both techniques, only text is a way to be searched for, no image or graph search support. The Semantic Search is more effective then the keyword search. But a time complexity of the Semantic Search is a big issue. The Keyword based search is also effective if user has some knowledge about searching fields. Otherwise the keyword search produced keyword bounded results which may not appropriate for a user. With same keywords both techniques produces different results. The Semantic Search uses knowledge base and produces more relevant results. Future scope for this research is to refine the Semantic Search time complexity. A mixture of both techniques can be defined to take advantages of both techniques. With the mixture of both techniques there are possibilities to overcome the Black Hat techniques which bypass the Search Engine rules.

6| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425 REFERENCES [1] Ping-Tsai, Sarah, H. Chung and H. Chun-Keung, “A web server design using Search Engine optimization techniques for web intelligence for small organizations, “Systems, Applications and Technology Conference IEEE, ISBN: 978-1-4577-1342-2, 2012.

[2] Neelam Tyagi, International Journal of Advanced Research in Computer Science and Software Engineering 5(1), January - 2015, pp. 1050-1056© 2015, IJARCSSE All Rights Reserved Page | 1056

[3] Meng Cui, Songyun Hu, “Search Engine Optimization Research for Website Promotion, Transport Management College, Dalian Maritime University, Dalian, 116026, China”.

[4] H. Zhou, S. Qin., J. Liu,“ Study on Website Search Engine Optimization,” International Conference on Computer Science and Service System IEEE, 2012.

[5] Ayush Jain, Meenu Dave, “The Role of Backlinks in Search Engine Ranking” International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 4, April 2013”.

[6] “21 Off-Page SEO Strategies to Build Your Online Reputation”. Online. Available:http://www.seomoz.org/ugc/21offpage-seo- strategies-to-build-your-online-reputatio

[7] K. ur Rehman and M. N. Ahmed Khan “ The Foremost Guidelines for Achieving Higher Ranking in Search Results through Search Engine Optimization” International Journal of Advanced Science and Technology Vol. 52, March, 2013

[8] P. Chahal, M. Singh and S. Kumar “Ranking of Web Documents using Semantic Similarity” Information Systems and Computer Networks (ISCON), 2013 International Conference on ©2013 IEEE, DOI 10.1109/ICISCON.2013.6524191 Page(s): 145 – 150

[9] N. V. Pardakhe, Prof. R. R. Keole “Analysis of Various Web Page Ranking Algorithms in Web Structure Mining” International Journal of Advanced Research in Computer and Communication Engineering Vol.2, Issue 12, December 2013

[10] M. Cui, S. Hu “Serach Engine Optmization Research for Website Promotion” Information Technology, Computer Engineering and Management Sciences (ICM), 2011 International Conference on (Volume:4 ) © 2011 IEEE, DOI 10.1109/ICM.2011.308Page(s):100 – 103

[11] Sergey Brin and Lawrence Page, “The Anatomy of a Large-Scale Hypertextual Web Search Engine”. Online Available: http://infolab.stanford.edu/~backrub/google.html

[12] Christopherd Manning, R. Prabhakar and S. Hinrich, “An Introduction to Information Retrieval”, Cambridge University Press, 2009.

[13] G. Shikha, Y. Sunit and G. Raj Kumar, “Search Engine Evaluation Based on Page Level Keywords”, IEEE,ISBN: 978-1- 4673-4529-3, 2012.

[14] “Search Engine Optimization”. Online. Available: http://en.wikipedia.org/wiki/Search_engine_optimization

[15] J. Yi, L. Zhuying and L. Hongwei, “The Research of Search Engine Based on Semantic Web, “International Symposium on Intelligent Information Technology Application Workshops IEEE,978-0-7695-3505-0, 2008.

[16] K. Junaidah Mohamed and R. Mahathir,” Introduction to Semantic Search Engine, “International Conference on Electrical Engineering and informatics IEEE, ISBN: 978-1-4244-4913-2, 2009.

[17] K. Phyo Thu Thu, W. Htwe Pa Pa, T. KhinNwe Ni,“ Keyword Searching and Browsing System over Relational Databases,” IEEE, ISSN : 978-1-4577-1539-6/11, 2011.

[18] M. Debajyoti , S. Manoj , J. Gajanan , P. Trupti, P. Adarsha, “Experience of Developing a Meta-Semantic Search Engine,“ IEEE, ISSN : 978-0-4799-2235-2/13, 2013.

[19] P. Swati and P. Ajay.,”Search Engine Optimization: A Study,”Research Journal of Computer and Information Technology Sciences, Vol. 1(1), P.P 10-13, February 2013.

[20] S.G. Choudhary, S.R. Kalmegh and Dr. S. N. Deshmukh, “Semantic Search Algorithms based on Page Rank and Ontology: A Review”, 3rd International Conference on Intelligent Computational Systems IEEE, 2013.

[21] S. Robin, K. Ankita and B. Priyanka, “Web Page Indexing through Page Ranking for Effective Semantic Search, “Proceedings of 7th International Conference on Intelligent Systems and Control IEEE, ISBN: 978-1-4673-4603-0, 2012.

7| www.yloop.in AIJ-200 American International Journal of Contemporary Scientific Research ISSN 2349 – 4425

[22] T. Duygu, S. Mohammad Ahmed and B. Yıltan, “An Empirical Evaluation on Semantic Search Performance of Keyword- Based and Semantic Search Engines: Google, Yahoo, Msn and Hakia,” Fourth International Conference on Internet Monitoring and Protection IEEE,ISBN: 978-0-7695-3612-5, 2009.

[23] T. PhyoKhine, Htwe Pa Win and KhinNwe Ni Tun, “Keyword Searching and Browsing System over Relational Databases,”IEEE,ISBN: 978-1-4577-1539-6, 2011.

8| www.yloop.in