Lecture 8: CGI (Common Gateway Interface ) Outline CGI Overview

Total Page:16

File Type:pdf, Size:1020Kb

Lecture 8: CGI (Common Gateway Interface ) Outline CGI Overview Outline CGI Overview Between Client and Handler Lecture 8: CGI Between Web Server and Handler (Common Gateway Interface ) Wendy Liu CSC309F – Fall 2007 1 2 Common Gateway Interface (CGI) CGI Overview http://www.oreilly.com/openbook/cgi/ Access to dynamic server-side content Services Data 3 4 1 Components of the CGI Model CGI Interaction (1) Clients Web browsers Web server Mediates communication between browsers and handler programs CGI Protocol Specifies interaction between Browser and handler programs Function call syntax Web server and handler programs Handler programs (or CGI scripts) Client sends request with a URL+additional info Any executables residing on the web server Web server receives the request Can be written in any language Web server identifies the request as a CGI request C, C++, Perl, VB, Python, SmallTalk, Assembly, Lisp, etc. Web server locates the handler program 5 6 CGI Interaction (2) CGI Interaction (3) Web server starts up the handling program Handler program executes Heavy weight process creation Output of the handler is sent to the Web server via stdout Output is typically a web page Web server feeds request parameters to handler Web server returns output to the requesting web browser Through stdin or environment variables 7 8 2 Handling CGI Process CGI Example in Perl #!/local/bin/perl print "Content-type: text/html\n\n"; Main Process Request for print "<html>\n"; Child Process for CGI1 CGI1 print "<body>\n"; Request for print "<h1>hello</h1>\n"; Child Process for CGI2 CGI2 print "</body>\n"; Request for print "</html>\n"; Child Process for CGI1 CGI1 In Web page CGI-based Web Server <a href=“cgi-bin/hello.pl”> Click to run CGI </a> Cause the execution of the Perl script hello.pl 9 10 Client - Handler Interaction Clients request the execution of a handler program by specifying A request method (e.g., GET, POST) Between Client and Handler Universal Resource Locator (URL) Locates the handler Indirect interaction Additional arguments Query string 11 12 3 Query String Encoding HTTP Methods Query string is used to transmit data associated with GET a GET/POST request Query string attached to the URL after ‘?’ All arguments are packed into a single string As part of the URL Each argument is represented by a name-value pair as URL may have a length restriction on the server side name=value Arguments are separated by ampersand, & Limited amount of information can be passed name_1=value_1&name_2=value_2 Arguments appear in server logs Spaces in a name or value are replaced by a plus POST sign, + Query string encoded in the HTTP request body URLs cannot have spaces in them Not part of the URL Other characters (=, &, +) are replaced by a percent sign (%) followed by a two-digit hexadecimal value Arbitrarily long form data can be communicated Arguments usually do not appear in server logs 13 14 Forms Example: GET Forms Example: POST On the Web page On the Web page <form action=“cgi-bin/login.pl” method=“get”> <form action=“cgi-bin/login.pl” method=“post”> <input type=“text” name=“id” /> <input type=“text” name=“id” /> <input type=“text” name=“password”/> <input type=“text” name=“password”/> </form> </form> Request sent to web server Request sent to web server Assuming user enters: ‘abc’ and ‘123’ Assuming user enters: ‘abc’ and ‘123’ POST cgi-bin/login.pl HTTP/1.1 GET cgi-bin/login.pl?id=abc&password=123 <headers …> HTTP/1.1 id=abc&password=123 15 16 4 Web Server - Handler Interaction Information about a request comes from Request line Header line Between Web Server and Handler Request-body (POST) Handler input Environment variables stdin (request body) Handler output stdout 17 18 Environment Variables Example –Perl Script AUTH_TYPE REMOTE_IDENT #!/local/bin/perl print "Content-type: text/plain\n\n"; CONTENT_LENGTH REMOTE_USER print "Environment\n"; CONTENT_TYPE REQUEST_METHOD @keys = keys %ENV; # variable names GATEWAY_INTERFACE @values = values %ENV; # variable values SCRIPT_NAME while (@keys) { PATH_INFO SERVER_NAME print pop(@keys), '=', pop(@values), "\n"; PATH_TRANSLATED } SERVER_PORT QUERY_STRING $request_method = $ENV{'REQUEST_METHOD'}; SERVER_PROTOCOL if ($request_method eq "POST") { REMOTE_ADDR $formsize = $ENV{'CONTENT_LENGTH'}; REMOTE_HOST SERVER_SOFTWARE read (STDIN, $form_info, $formsize); print "\nSTDIN\n" print $form_info; } 19 20 5 Example – Web Page Example – GET Request <body> HTTP/1.1 200 OK <h3>GET</h3> Date: Mon, 01 Oct 2007 23:17:56 GMT <form action="cgi-bin/stdin.pl" method="get"><p> Server: Apache/1.3.29 (Unix) Student Number: <input type="text" name="studentNum"/> <br /> Last-Modified: Wed, 12 Sep 2007 20:22:21 GMT Id: <input type="text" name="id"/> <br /> Accept-Ranges: bytes Password: <input type="password" name="password"/> <br /> Return Files: <input type="checkbox" name="returnFiles"/> <br /> Content-Length: 65 <input type="submit" name="submit" /></p> Keep-Alive: timeout=15, max=100 </form> Connection: Keep-Alive Content-Type: text/plain <h3>POST</h3> <form action="cgi-bin/stdin.pl" method="post"><p> Environment Student Number: <input type="text" name="studentNum"/> <br /> SERVER_PORT=80 Id: <input type="text" name="id"/> <br /> Password: <input type="password" name="password"/> <br /> REQUEST_METHOD=GET Return Files: <input type="checkbox" name="returnFiles"/> <br /> … <input type="submit" name="submit" /><p> QUERY_STRING=studentNum=a&id=b&password=234&returnFiles=on&submit </form> =Submit </body> … 21 22 Example – POST Request Important Announcement HTTP/1.1 200 OK Date: Mon, 01 Oct 2007 23:17:56 GMT Tuesday Oct 16, 2007 Server: Apache/1.3.29 (Unix) Last-Modified: Wed, 12 Sep 2007 20:32:21 GMT 2-hr tutorial given in a CDF lab, BA3185 Accept-Ranges: bytes Content-Length: 65 Thursday Oct 18, 2007 Keep-Alive: timeout=15, max=100 Connection: Keep-Alive Midterm, BA2195 Content-Type: text/plain Starting Tuesday Oct 23, 2007 Environment SERVER_PORT=80 The official lecture room will be BA2185 on each REQUEST_METHOD=POST … Tuesday until the end of term QUERY_STRING= No change to Thursdays’ classes (BA1240) … STDIN studentNum=a&id=b&password=234&returnFiles=on&submit=Submit 23 24 6.
Recommended publications
  • The Common Gateway Interface and Server-Side Programming
    WebWeb MasterMaster 11 IFIIFI Andrea G. B. Tettamanzi Université de Nice Sophia Antipolis Département Informatique [email protected] Andrea G. B. Tettamanzi, 2019 1 Unit 3 The Common Gateway Interface and Server-side Programming Andrea G. B. Tettamanzi, 2019 2 Agenda • The Common Gateway Interface • Server-Side Programming Andrea G. B. Tettamanzi, 2019 3 Introduction • An HTTP server is often used as a gateway to a different information system (legacy or not), for example – an existing body of documents – an existing database application • The Common Gateway Interface (CGI) is an agreement between HTTP server implementors about how to integrate such gateway scripts and programs • It was typically (but not exclusively) used in conjunction with HTML forms to build database applications • Nowadays largely superseded by dynamic Web content technologies such as PHP, ASP.NET, Java Servlets, and Node.js Andrea G. B. Tettamanzi, 2019 4 The Common Gateway Interface • The Common Gateway Interface (CGI) is a de facto standard protocol for Web servers to execute an external program that generates a Web page dynamically • The external program executes like a console application running on the same machine as the Web server (the host) • Such program is known as a CGI script or simply as a CGI Andrea G. B. Tettamanzi, 2019 5 How Does That Work? • Each time a client requests the URL corresponding to a CGI program, the server will execute it in real-time – E.g.: GET http://www.example.org/cgi-bin/add?x=2&y=2 • The output of the program will go more or less directly to the client • Strictly speaking, the “input” to the program is the HTTP request • Environment variables are used to pass data about the request from the server to the program – They are accessed by the script in a system-defined manner – Missing environment variable = NULL value – Character encoding is system-defined Andrea G.
    [Show full text]
  • A Few RPO Exploitation Techniques
    MBSD Technical Whitepaper A few RPO exploitation techniques Takeshi Terada / Mitsui Bussan Secure Directions, Inc. June 2015 Table of Contents 1. Introduction .......................................................................................................... 1 2. Path manipulation techniques ............................................................................. 2 2.1. Loading another file on IIS/ASP.NET ......................................................................... 2 2.2. Loading another file on Safari/Firefox ....................................................................... 3 2.3. Loading another file on WebLogic/IE ......................................................................... 4 2.4. Loading file with query string on WebLogic+Apache ................................................ 5 2.5. Attack possibility in other environments ................................................................... 5 3. Forcing IE’s CSS expression via CV .................................................................... 7 4. Non-stylesheet RPO attacks ................................................................................ 9 5. A path handling bug in CakePHP ...................................................................... 11 6. Conclusion .......................................................................................................... 13 7. References........................................................................................................... 14 8. Test environments .............................................................................................
    [Show full text]
  • Internet Engineering Task Force (IETF) M. Nottingham Request for Comments: 7320 BCP: 190 July 2014 Updates: 3986 Category: Best Current Practice ISSN: 2070-1721
    Internet Engineering Task Force (IETF) M. Nottingham Request for Comments: 7320 BCP: 190 July 2014 Updates: 3986 Category: Best Current Practice ISSN: 2070-1721 URI Design and Ownership Abstract Section 1.1.1 of RFC 3986 defines URI syntax as "a federated and extensible naming system wherein each scheme's specification may further restrict the syntax and semantics of identifiers using that scheme." In other words, the structure of a URI is defined by its scheme. While it is common for schemes to further delegate their substructure to the URI's owner, publishing independent standards that mandate particular forms of URI substructure is inappropriate, because that essentially usurps ownership. This document further describes this problematic practice and provides some acceptable alternatives for use in standards. Status of This Memo This memo documents an Internet Best Current Practice. This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on BCPs is available in Section 2 of RFC 5741. Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at http://www.rfc-editor.org/info/rfc7320. Nottingham Best Current Practice [Page 1] RFC 7320 URI Design Ownership July 2014 Copyright Notice Copyright (c) 2014 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document.
    [Show full text]
  • API Technical Document
    Case API Service Technical Documentation Date: December 19, 2017 Case API Service – Technical Documentation 1 Expected Knowledge .............................................................................................................. 3 2 Getting Started ........................................................................................................................ 3 3 More Technical Information ................................................................................................... 3 3.1 Request reference ............................................................................................................. 3 3.1.1 Overview ................................................................................................................... 3 3.1.2 REST style interface ................................................................................................. 3 3.1.3 Requests .................................................................................................................... 4 3.1.4 Authentication ........................................................................................................... 4 3.1.5 Encryption ................................................................................................................. 4 3.1.6 Resources .................................................................................................................. 4 3.1.7 Request Parameters ..................................................................................................
    [Show full text]
  • MCSD Programming in HTML5 with Javascript and CSS3
    Microsoft 70-480 MCSD Programming in HTML5 with JavaScript and CSS3 Microsoft 70-480 Dumps Available Here at: https://www.certification-questions.com/microsoft-exam/70-480-dumps.html Enrolling now you will get access to 317 questions in a unique set of 70- 480 dumps Question 1 You are developing a customer contact form that will be displayed on a page of a company's website. The page collects information about the customer. If a customer enters a value before submitting the form, it must be a valid email address. You need to ensure that the data validation requirement is met. What should you use? Options: A. <input name="email" type="url"/> B. <input name="email" type="text" required="required"/> C. <<input name="email" type="text"/> D. <input name="email" type="email"/> Answer: D Explanation: The <input type="email"> is used for input fields that should contain an e-mail address. Depending on browser support, the e-mail address can be automatically validated when submitted. Some smartphones recognize the email type, and adds ".com" to the keyboard to match email input. Example: <form> E-mail: <input type="email" name="email"/> </form> Reference: http://www.w3schools.com/html/html5_form_input_types.asp Question 2 https://www.certification-questions.com Microsoft 70-480 DRAG DROP You are developing a form that captures a user's email address by using HTML5 and jQuery. The form must capture the email address and return it as a query string parameter. The query string parameter must display the @ symbol that is used in the email address. You need to implement this functionality.
    [Show full text]
  • Yarl Documentation Release 1.6.3
    yarl Documentation Release 1.6.3- Andrew Svetlov Sep 15, 2021 CONTENTS 1 Introduction 3 2 Installation 5 3 Dependencies 7 4 API documentation 9 5 Comparison with other URL libraries 11 6 Why isn’t boolean supported by the URL query API? 13 7 Source code 15 8 Discussion list 17 9 Authors and License 19 9.1 Public API................................................ 19 10 Indices and tables 31 Python Module Index 33 Index 35 i ii yarl Documentation, Release 1.6.3- The module provides handy URL class for URL parsing and changing. CONTENTS 1 yarl Documentation, Release 1.6.3- 2 CONTENTS CHAPTER ONE INTRODUCTION URL is constructed from str: >>> from yarl import URL >>> url= URL( 'https://www.python.org/~guido?arg=1#frag') >>> url URL('https://www.python.org/~guido?arg=1#frag') All URL parts: scheme, user, password, host, port, path, query and fragment are accessible by properties: >>> url.scheme 'https' >>> url.host 'www.python.org' >>> url.path '/~guido' >>> url.query_string 'arg=1' >>> url.query <MultiDictProxy('arg': '1')> >>> url.fragment 'frag' All URL manipulations produces a new URL object: >>> url.parent/ 'downloads/source' URL('https://www.python.org/downloads/source') A URL object can be modified with / and % operators: >>> url= URL( 'https://www.python.org') >>> url/ 'foo' / 'bar' URL('https://www.python.org/foo/bar') >>> url/ 'foo' %{ 'bar': 'baz'} URL('https://www.python.org/foo?bar=baz') Strings passed to constructor and modification methods are automatically encoded giving canonical representation as result: >>> url= URL( 'https://www.python.org/') >>> url URL('https://www.python.org/%D0%BF%D1%83%D1%82%D1%8C') 3 yarl Documentation, Release 1.6.3- Regular properties are percent-decoded, use raw_ versions for getting encoded strings: >>> url.path '/' >>> url.raw_path '/%D0%BF%D1%83%D1%82%D1%8C' Human readable representation of URL is available as human_repr(): >>> url.human_repr() 'https://www.python.org/' For full documentation please read Public API section.
    [Show full text]
  • Context Request Cookies Add
    Context Request Cookies Add Dunc is filthiest: she air-condition implicatively and incensing her Lindisfarne. Is Anurag ingrate or loricate when refurnishes some jigs wash-outs anticipatorily? If invincible or befouled Finn usually disarrays his godmothers hypersensitised indistinguishably or bellylaugh exchangeably and unrecognisable, how transformed is Nathanil? In the server is to add context cookies are separated by accessing restricted content Other iterables yielding byte strings are not joined because money may produce too big and fit improve memory. You propose also subclass the Response class to implement his own functionality. For those using web forms and vb. HTTP GET more an idempotent method that returns a resource from below given URL. If there is a parsley root associated with the flaw, the best root network is included within the traversal path. This dispute be used to gracefully shutdown connections that have undergone ALPN protocol upgrade or bishop have been hijacked. Useful avoid the new medicine is a special case what an existing tag. These virtual attributes return unicode strings, even if there value goes missing or unicode decoding fails. Do have post message bit in the dom has loaded. Register create custom template filter. For each handled request, the tracer shows the swirl of rules and actions that were carried out, open the changes made green the fist request at each step across the sequence. Response occasion the coach Request. The actual WSGI application. National Academy of Sciences. You perhaps not logged in. Click will be handle exceptions and twirl them some error messages and the function will never visit but roll down your interpreter.
    [Show full text]
  • HTTP Protocol Binding
    ONEM2M TECHNICAL SPECIFICATION Document Number TS-0009-V2.6.1 Document Name: HTTP Protocol Binding Date: 2016-August-30 Abstract: HTTP Protocol Binding TS This Specification is provided for future development work within oneM2M only. The Partners accept no liability for any use of this Specification. The present document has not been subject to any approval process by the oneM2M Partners Type 1. Published oneM2M specifications and reports for implementation should be obtained via the oneM2M Partners' Publications Offices. © oneM2M Partners Type 1 (ARIB, ATIS, CCSA, ETSI, TIA, TSDSI, TTA, TTC) Page 1 of 20 This is a draft oneM2M document and should not be relied upon; the final version, if any, will be made available by oneM2M Partners Type 1. About oneM2M The purpose and goal of oneM2M is to develop technical specifications which address the need for a common M2M Service Layer that can be readily embedded within various hardware and software, and relied upon to connect the myriad of devices in the field with M2M application servers worldwide. More information about oneM2M may be found at: http//www.oneM2M.org Copyright Notification No part of this document may be reproduced, in an electronic retrieval system or otherwise, except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. © 2016, oneM2M Partners Type 1 (ARIB, ATIS, CCSA, ETSI, TIA, TSDSI, TTA, TTC). All rights reserved. Notice of Disclaimer & Limitation of Liability The information provided in this document is directed solely to professionals who have the appropriate degree of experience to understand and interpret its contents in accordance with generally accepted engineering or other professional standards and applicable regulations.
    [Show full text]
  • A Securityfocused HTTP Primer Basics Query Strings, Parameters
    2/19/2016 A Security­focused HTTP Primer Home Blog Study Essays Projects Services Podcast About A Security­focused HTTP Primer Home » Study » A Security­focused HTTP Primer What follows is a primer on the key security­oriented characteristics of the HTTP protocol. It’s a collection of a number of different sub­topics, explained in my own way, for the purpose of having a single reference point when needed. Basics Query Strings, Parameters, and Ampersands URL Encoding Authentication HTTP Requests Request Methods HTTP Responses Status / Response Codes HTTP Headers Proxies Cookies Basics Message­based You make a request, you get a response. Line­based Lines are quite significant in HTTP. Each header is on an individual line (each line ends with a <crlf>), and a blank line separates the header section from the optional body section. Stateless HTTP doesn’t have the concept of state built­in, which is why things like cookies are used to track users within and across sessions. Query Strings, Parameters, and Ampersands Query Strings (?) A query string is defined by using the question mark (?) character after the URL being requested, and it defines what is being sent to the web application for processing. They are typically used to pass the contents of HTML forms, and are encoded using name:value pairs. http://google.com/search?query=mysearch https://danielmiessler.com/study/http/ 1/9 2/19/2016 A Security­focused HTTP Primer Parameters (something=something) In the request above the parameter is the “query” value–presumably indicating it’s what’s being searched for.
    [Show full text]
  • Http Request and Response
    Http Request And Response Patrice undermine mushily. Hierarchic and dipetalous Kevin dichotomise, but Sivert landward festinated her beseechers. Eric is transparently Afro-Asian after splenial Welbie denoted his purlines swinishly. The Server will deploy the pill body. The hot body, forwards the request create an inbound server, an get on the cache can reveal information long dream a user believes that the information has been removed from possible network. If and https. HTTP Fields Elastic Common Schema ECS Reference 17. Every HTTP interaction includes a folder and a harness By my nature HTTP is stateless Stateless means keep all requests are arise from. Warnings are http request are different summary with? HTTP Request Headers and CloudFront Behavior Custom. This class now performs all their necessary certificate and hostname checks by default. An HTTP response except what web clients often browsers receive it an Internet server in overall to an HTTP request These responses communicate valuable. May be stale responses from and https whenever their mime type header applies rules and encryption. URL of the server. Date is and response for. Tls and response message id is requested document if request for credentials sent to access and will be conservative behavior and larry masinter for a thread. HTTP verb was used for self request. Http listener servlet reads out css etc, it and an html document against threats to apply adjustments to use this loop controller will only one. POST request, them all that possible matches in the sampler data are processed. Container environment security for enter stage of drain life cycle.
    [Show full text]
  • REST Just Got Easy with SAS® and PROC HTTP Joseph Henry, SAS Institute Inc
    Paper SAS4426-2020 REST Just Got Easy with SAS® and PROC HTTP Joseph Henry, SAS Institute Inc. ABSTRACT Hypertext Transfer Protocol (HTTP) is the lifeblood of the web. It is used every time you upload a photo, refresh a web page, and in most every modern application you use today from your phone to your TV. A big reason why HTTP has become so ubiquitous with modern technology is because of a software architectural style known as Representational State Transfer (REST). REST has become the standard for interacting with independent systems across the web, which means that the consumption of REST APIs is, or is going to be, mandatory. The HTTP procedure in SAS® has enabled developers to use REST APIs simply and securely for many years now, but the demands of current systems keep increasing. PROC HTTP has been given quite a few updates to make using this great procedure even easier than it ever was before. INTRODUCTION The HTTP procedure has been a staple in SAS for many years for communicating with the web and Representational State Transfer (REST) APIs. Over the past few years, PROC HTTP has seen many updates that continue to increase its functionality and usability. SAS® Viya 3.5 continues this trend by introducing some new syntax to the procedure that makes many common tasks simpler as well as more accessible to newer SAS developers who might be used to other HTTP frameworks. The enhancements addressed in this paper are also available in SAS®9 as part of the second release of SAS 9.4M6.
    [Show full text]
  • 10.1 the Common Gateway Interface 10.2 CGI Linkage 10.2 CGI Linkage (Continued)
    10.1 The Common Gateway Interface 10.2 CGI Linkage 10.2 CGI Linkage (continued) - Markup languages cannot be used to specify - CGI programs often are stored in a directory named <!-- reply.html - calls a trivial cgi program cgi-bin computations, interactions with users, or to --> provide access to databases <html> - Some CGI programs are in machine code, but Perl <head> programs are usually kept in source form, so <title> - CGI is a common way to provide for these needs, perl must be run on them by allowing browsers to request the execution of HTML to call the CGI-Perl program reply.pl </title> server-resident software - A source file can be made to be “executable” by </head> adding a line at their beginning that specifies that <body> - CGI is just an interface between browsers and a language processing program be run on them This is our first CGI-Perl example servers first <a href = "http://www.cs.ucp.edu/cgi-bin/reply.pl"> - An HTTP request to run a CGI program specifies a For Perl programs, if the perl system is stored in Click here to run the CGI program, reply.pl </a> program, rather than a document /usr/local/bin/perl, as is often is in UNIX </body> systems, this is - Servers can recognize such requests in two ways: </html> #!/usr/local/bin/perl -w 1. By the location of the requested file (special - The connection from a CGI program back to the subdirectories for such files) - An HTML document specifies a CGI program with requesting browser is through standard output, the hypertext reference attribute, href, of an anchor usually through the server 2.
    [Show full text]