www.webdeveloper.com
Results 1 to 2 of 2

Thread: Search Engine using Perl / CGI --- help

  1. #1
    Join Date
    Feb 2005
    Location
    Frisco, Texas
    Posts
    18

    Search Engine using Perl / CGI --- help

    I am very new to Perl but my job requires a simple search engine for our intranet site on an Apache server. I found a script from O'Reilly CGI Programming w/ Perl book. After some modifcations (with help) it runs with out errors. I can enter a keyword and it will run a search and display results. PROBLEM #1 The problem is this...unless the file is in the top level directory then it wont find it. I need to get it to search all the subdirectories and the folders within them. PROBLEM #2 When the search displays the results page the hyperlinks dont work. An example of <a href> created is http://company.com/cgi-bin/VIRTUAL_PATH/pagename.html. My guess with be the VIRTUAL_PATH is causing the problem but I have no real idea.

    Any help is appreciated...many thanks.

    here is the html file:

    <HTML>
    <HEAD>
    <TITLE>Simple 'Mindless' Search</TITLE>
    </HEAD>
    <BODY>
    <H1>Are you ready to search?</H1>
    <P>
    <FORM ACTION="/cgi-bin/grep_search2.cgi" METHOD="GET">
    <INPUT TYPE="test" NAME="query" SIZE="20">
    <INPUT TYPE="submit" VALUE="GO!">
    </FORM>
    </BODY>
    </HTML>


    here is the perl script:


    #!/usr/local/bin/perl -w

    use strict;
    use CGI;
    #use CGIBook::Error;
    use CGI::Carp 'fatalsToBrowser';

    my $DOCUMENT_ROOT = $ENV{DOCUMENT_ROOT};
    my $VIRTUAL_PATH = "";
    my $q = new CGI;
    my $query = $q->param( "query" );

    #if ( defined $query and length $query ) {
    # die "Please specify a valid query!";
    #}

    $query = quotemeta ( $query );
    my $results = search ( $q, $query );

    print $q->header( "text/html" ),
    $q->start_html( "Simple Perl Search" ),
    $q->h1( "Search for: $query" ),
    $q->ul ( $results || "No matches found" ),
    $q->end_html;

    sub search {
    my ( $q, $query ) = @_;
    my ( %matches, @files, @sorted_paths, $results );

    local( *DIR, *FILE );

    opendir DIR, $DOCUMENT_ROOT or
    error ( $q, "Cannot access search dir!" );

    @files = grep { -T "$DOCUMENT_ROOT/$_" } readdir DIR;
    closedir DIR;

    my $file;
    foreach my $file ( @files ) {
    my $full_path = "$DOCUMENT_ROOT/$file";
    open FILE, $full_path or
    error ( $q, "Cannot process $file!" );

    while ( <FILE> ) {
    if (/$query/io ) {
    $_ = html_escape( $_ );
    s| ($query) |<B>$1</B>|gio;
    push @{ $matches{$full_path}{content} }, $_;
    $matches{$full_path}{file} = $file;
    $matches{$full_path}{num_matches}++;
    }
    }
    close FILE;
    }

    @sorted_paths = sort {
    $matches{$b}{num_matches} <=>
    $matches{$a}{num_matches} ||
    $a cmp $b
    } keys %matches;

    my $full_path;
    foreach my $full_path ( @sorted_paths ) {
    my $file = $matches{$full_path}{file};
    my $num_matches = $matches{$full_path}{num_matches};
    my $link = $q->a( { -href => "VIRTUAL_PATH/$file" }, $file );
    my $content = join $q->br, @{ $matches{$full_path}{content} };

    $results .= $q->p( $q->b( $link ) . " ($num_matches matches)" .
    $q->br . $content
    );
    }

    return $results;

    }



    sub html_escape {
    my ( $text ) = @_;

    $text =~ s/&/&amp;/g;
    $text =~ s/</&lt;/g;
    $text =~ s/>/&gt;/g;
    return $text;

    }

  2. #2
    Join Date
    Mar 2004
    Posts
    282
    Try the Simple Search on this site - http://nms-cgi.sourceforge.net/scripts.shtml - from London PM

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
HTML5 Development Center

"

"

X vBulletin 4.2.2 Debug Information

  • Page Generation 0.12603 seconds
  • Memory Usage 2,839KB
  • Queries Executed 13 (?)
More Information
Template Usage (32):
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_global_above_footer
  • (1)ad_global_below_navbar
  • (1)ad_global_header1
  • (1)ad_global_header2
  • (1)ad_navbar_below
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)ad_thread_first_post_content
  • (1)ad_thread_last_post_content
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)headinclude_bottom
  • (2)memberaction_dropdown
  • (1)navbar
  • (4)navbar_link
  • (1)navbar_moderation
  • (1)navbar_noticebit
  • (1)navbar_tabs
  • (2)option
  • (2)postbit
  • (2)postbit_onlinestatus
  • (2)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available (6):
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files (26):
  • ./showthread.php
  • ./global.php
  • ./includes/class_bootstrap.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/functions_navigation.php
  • ./includes/class_friendly_url.php
  • ./includes/class_hook.php
  • ./includes/class_bootstrap_framework.php
  • ./vb/vb.php
  • ./vb/phrase.php
  • ./includes/functions_facebook.php
  • ./includes/functions_calendar.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_notice.php
  • ./packages/vbattach/attach.php
  • ./vb/types.php
  • ./vb/cache.php
  • ./vb/cache/db.php
  • ./vb/cache/observer/db.php
  • ./vb/cache/observer.php 

Hooks Called (70):
  • init_startup
  • friendlyurl_resolve_class
  • init_startup_session_setup_start
  • database_pre_fetch_array
  • database_post_fetch_array
  • init_startup_session_setup_complete
  • global_bootstrap_init_start
  • global_bootstrap_init_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • load_show_variables
  • load_forum_show_variables
  • global_state_check
  • global_bootstrap_complete
  • global_start
  • style_fetch
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • strip_bbcode
  • friendlyurl_clean_fragment
  • friendlyurl_geturl
  • forumjump
  • cache_templates
  • cache_templates_process
  • template_register_var
  • template_render_output
  • fetch_template_start
  • fetch_template_complete
  • parse_templates
  • fetch_musername
  • notices_check_start
  • notices_noticebit
  • process_templates_complete
  • friendlyurl_redirect_canonical
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • memberaction_dropdown
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • build_navigation_data
  • build_navigation_array
  • check_navigation_permission
  • process_navigation_links_start
  • process_navigation_links_complete
  • set_navigation_menu_element
  • build_navigation_menudata
  • build_navigation_listdata
  • build_navigation_list
  • set_navigation_tab_main
  • set_navigation_tab_fallback
  • navigation_tab_complete
  • fb_like_button
  • showthread_complete
  • page_templates