irsug sas

download irsug sas

of 28

Transcript of irsug sas

  • 8/9/2019 irsug sas

    1/28

  • 8/9/2019 irsug sas

    2/28

  • 8/9/2019 irsug sas

    3/28

    iii

    Contents

    About Th is Book ....................................................................................... v

    Audience ................ .............. ............... .............. ............... ............... .............. ... v

    Prerequisites ............... .............. ................ .............. ............... ............... ............ vConventions ............. .............. ................ .............. ............... .............. ................ v

    1 About SAS Information Retrieval Studio .............................................1

    1.1 What Is SAS Information Retrieval Studio? ........ ................ ............... ....... 1

    1.2 Benefits of Using SAS Information Retrieval Studio ......... ............... ........ 2

    1.3 How Does SAS Information Retrieval Studio Work? .............. .............. ... 3

    1.4 How Does SAS Information Retrieval Studio Fit into the SAS

    Product Line? ............... .............. ............... ............... ............... ............... .......... 3

    2 SAS Informat ion Retrieval Studio Search Window .............................5

    2.1 Your First Look at the User Window for SAS Information

    Retrieval Studio .............. ............... .............. ............... ............... ............... ........ 52.2 Viewing Search Returns ............... ............... ................ ............... ................ 6

    2.2.1 Displays with or without Labels ............... ............... ............... ......... 6

    2.2.2 No Labels Example ............. ................ ............... .............. ................ 6

    2.2.3 Labels Example .............. ............... .............. ............... ............... ....... 7

    2.2.4 Hierarchical Labels Example .................... ............... ................ ........ 7

    2.2.5 No Hierarchical Display Example ...... .............. ............... .............. .. 9

    3 Search Syntax ........................................................................................11

    3.1 Two Types of Search Syntax ............... ............... ............... ................ ........ 11

    3.2 Simple Search .............. ............... .............. ................ .............. ............... ..... 12

    3.2.1 Overview of Simple Search ..... ............... ................ .............. ........... 12

    3.2.2 Determining Relevancy ................... ............... .............. ................ ... 12

    3.3 Advanced Search ............... .............. ............... .............. ................ .............. 12

    3.3.1 Overview of Advanced Search .............. ............... ................ ........... 12

    3.3.2 Advanced Operators .............. ............... ................ ............... ............. 13

    Appendixes ............................................................................ 15

    A Recommended Reading ........................................................................17

    B Glossary .................................................................................................19

  • 8/9/2019 irsug sas

    4/28

    iv SAS Information Retrieval Studio: Users Guide

    Index ...........................................................................................................21

  • 8/9/2019 irsug sas

    5/28

    v

    About This Book

    Audience

    SAS Information Retrieval Studio: Users Guideis designed for persons who

    search the indexes built by the indexing server.

    SAS Information Retrieval Studio enables you to specify your search terms

    using various operators. This documentation focuses on ways to search the

    index to retrieve the information that you seek.

    Prerequisites

    Here are the prerequisites for using the search window for SAS

    Information Retrieval Studio:

    - SAS Information Retrieval Studio installed on your machine.

    - A supported browser installed on your desktop client.

    If you have any questions about whether you are ready to use SAS Information

    Retrieval Studio, contact your system administrator.

    Conventions

    This manual uses the following typographical conventions:

    Convention Description

    TGM_ROOT The root directory where SAS Information Retrieval Studio isinstalled, typically the following:

    Windows:C: / Program Fi l es/ SAS I nf or mat i onRet r i eval Studi oUNIX:/ opt / SAS I nf ormat i on Ret r i eval Studi o

    . xml Code examples are shown in a fixed-width font.

  • 8/9/2019 irsug sas

    6/28

    vi SAS Information Retrieval Studio: Users Guide

    Searchbutton The labels for user interface controls are shown in a bold, sans-

    serif font.

    www. sas. com The hypertext links are shown in a light blue, fixed-width font,and are underlined.

    Convention Description

    http://www.sas.com/http://www.sas.com/http://www.sas.com/
  • 8/9/2019 irsug sas

    7/28

    1About SAS Information

    Retrieval Studio- What Is SAS Information Retrieval Studio?

    - Benefits of Using SAS Information Retrieval Studio

    - How Does SAS Information Retrieval Studio Work?

    - How Does SAS Information Retrieval Studio Fit into the SAS Product

    Line?

    1.1 What Is SAS Information Retrieval Studio?In many organizations, diverse information consumers need to quickly access

    specific data. In an environment where data, and its types, grow exponentially

    there is a need to search and locate information from numerous sources. SAS

    Information Retrieval Studio combines several key technologies to provide a

    comprehensive solution to search.

    Search

    You, the user, can search the indexes created in SAS Information Retrieval

    Studio using several types of search syntax.

    Specify different types of search syntax

    There are two types of search:

    Simple

    Type words into the search field to locate information. You can also

    prefix these words with plus (+) or minus (-) signs.

    Advanced

    Add Boolean, positional, and counting operators to your search.

  • 8/9/2019 irsug sas

    8/28

    2 SAS Information Retrieval Studio: Users Guide

    See and search related documents

    See a list of the matching documents in the search interface. Depending on

    the choices made by the administrator, these documents might also be

    categorized, have concepts extracted, or facts. If the administrator made

    any of these selections, the documents are related to labels.

    Labels enable you to use facetted search to intuitively locate yourinformation.

    1.2 Benefits of Using SAS InformationRetrieval Studio

    SAS Information Retrieval Studio provides the following benefits:

    Empowers business owners by locating data

    SAS Information Retrieval Studio includes functionality that is designed

    to fit your organizations requirements. Use this program to locate the data

    that you seek.

    Improves the business value of IT and the corporate data that it manages

    SAS Information Retrieval Studio provides you with easy, self-service

    access to the information contained in your documents. Use SAS

    Information Retrieval Studio to search your data.

    Saves money on training and support costs

    SAS Information Retrieval Studio is so simple that you can quickly

    become self-sufficient, with minimal IT support and no need for extensive

    training. Once you start using SAS Information Retrieval Studio you are

    no longer dependent on the IT staff.

  • 8/9/2019 irsug sas

    9/28

    SAS Information Retrieval Studio: Users Guide 3

    1.3 How Does SAS Information RetrievalStudio Work?

    SAS Information Retrieval Studio is an application that anyone can use to

    locate documents in an index.

    1.4 How Does SAS Information Retrieval StudioFit into the SAS Product Line?

    As an integral part of the SAS product line, SAS Information Retrieval Studio

    provides a comprehensive search solution. Use this application to see

    matching categories, concepts, and facts that are applied from projects running

    on SAS Content Categorization Server. If specified by the administrator, you

    can see these classifications for your documents as labels.

  • 8/9/2019 irsug sas

    10/28

    4 SAS Information Retrieval Studio: Users Guide

  • 8/9/2019 irsug sas

    11/28

    2SAS Information Retrieval

    Studio Search Window- Your First Look at the User Window for SAS Information Retrieval

    Studio

    - Viewing Search Returns

    2.1 Your First Look at the User Window forSAS Information Retrieval Studio

    SAS Information Retrieval Studio provides the search box that you use to locate

    the information that you are seeking.

    Display 2.1 The User Window

    Search box

    type your query into this field.

    Search button

    click to locate matching documents.

    Clear button

    click to erase the query terms that you entered.

  • 8/9/2019 irsug sas

    12/28

    6 SAS Information Retrieval Studio: Users Guide

    2.2 Viewing Search Returns

    2.2.1 Displays with or without Labels

    The search results that are returned to your query are displayed according to

    the requirements specified by the administrator of SAS Information RetrievalStudio.

    Labels enable you to use facetted search. Facetted search makes it possible for

    you to click on a link, or label, and to see a list of matching documents. If you

    do not find the precise results that you seek, click another, related label and

    continue this process until you intuitively locate the information that you

    require.

    2.2.2 No Labels Example

    If your administrator chose to use no labels, search results are displayed in list

    format. To see a matched document, select the blue hyperlink that appears tothe right of the number in the ordered list.

    Display 2.1 No Labels

  • 8/9/2019 irsug sas

    13/28

    SAS Information Retrieval Studio: Users Guide 7

    2.2.3 Labels Example

    See one example of the labels that your search results might return below:

    Display 2.2 Labels Example

    2.2.4 Hierarchical Labels Example

    If your administrator specified a hierarchical ordering of labels, the matching

    sections of the taxonomy for categories are displayed. (Concepts and facts do

    not have parent-child relationships. They only appear in list format.)

    Categories precede concepts in this list format.

  • 8/9/2019 irsug sas

    14/28

    8 SAS Information Retrieval Studio: Users Guide

    This hierarchical structure is a matched portion of the same taxonomy that

    appears in the Taxonomy pane of the SAS Content Categorization Studio user

    interface. You can also see a count of the matching documents for each label.

    Display 2.3 Hierarchical Taxonomy of Matched Labels

    Click the left mouse button on a hyperlink label to make one of the following

    selections:

    Require

    labels appear in the SAS Information Retrieval Studio search window. The

    displayed documents match both the query term and the selected label. If

    you specify more than one label, the documents match the query term and

    all of the selected labels.

    Exclude

    one label, preceded by the minus sign (-) appears in the SAS Information

    Retrieval Studio search window. The displayed documents match thequery term, but not the selected labels.

  • 8/9/2019 irsug sas

    15/28

    SAS Information Retrieval Studio: Users Guide 9

    View

    one label appears in the SAS Information Retrieval Studio search window

    that displays all of the matching documents for this label, only.

    Remove

    this operation is available for the label appearing at the top of the SAS

    Information Retrieval Studio search window. This selection is availableonly after you use any of the above operations.

    2.2.5 No Hierarchical Display Example

    If you do not see a hierarchical display, matching categories and concepts are

    displayed in list format. Categories precede concepts in this list format. Click

    on a link to see the paths, or parent-child relationships that exist in the SAS

    Content Categorization Studio taxonomy that they match.

    Display 2.4 No Hierarchy

    Note: You can also use the Require, Exclude, View, andRemoveoperations. For more information, see Section2.2.4 Hierarchical Labels Exampleon page 7.

  • 8/9/2019 irsug sas

    16/28

    10 SAS Information Retrieval Studio: Users Guide

  • 8/9/2019 irsug sas

    17/28

    3Search Syntax

    - Two Types of Search Syntax

    - Simple Search

    - Advanced Search

    3.1 Two Types of Search Syntax

    SAS Information Retrieval Studio processes two types of searches:

    Simple

    use the traditional type of document search. You input a query term suchas SAS, SAS Information Retrieval Studio, or business solutions to locate

    the information that you seek.

    Advanced

    check with your administrator before you qualify your query terms with

    Boolean operators. The Boolean search request includes the use of

    Boolean operators such as AND, DIST, or NOT. A Boolean query can

    match either a standard field or return an exact match on a Boolean field.

    Note: These fields are specified by the administrator of SASInformation Retrieval Studio project.

  • 8/9/2019 irsug sas

    18/28

    12 SAS Information Retrieval Studio: Users Guide

    3.2 Simple Search

    3.2.1 Overview of Simple Search

    Simple search is the traditional way to search documents. A query consists of

    a list of terms such as business solutions, search and indexing, or queryprotocol. If at least one query term matches within a document field, the

    document is considered to be a match for the query. For example, if the word

    searchis located in a document, the document is returned as a match.

    3.2.2 Determining Relevancy

    The relevancy of each matched document to the query is determined by the

    number of matching terms. More matches equal a higher relevancy score for

    the document. In addition, terms can also be marked as required or excluded.

    Each term can be either a single word or a quoted phrase. If the word or phrase

    (minus the quotation marks ['' '']), appears anywhere within the specifiedfields, the document is a match.

    3.3 Advanced Search

    3.3.1 Overview of Advanced Search

    Advanced search is a more complex way to search documents and return

    precise results. An advanced query consists of a list of a hosted set of

    arguments. (In this case, hosted means arguments that are supported by SASInformation Retrieval Studio.) These arguments can be other operators or

    query terms. If at least one query term matches within a document field, and

    according to the requirements specified by the operators, the document is a

    match. For example, if the word searchis located in a document, this

    document is returned as a match.

  • 8/9/2019 irsug sas

    19/28

    SAS Information Retrieval Studio: Users Guide 13

    3.3.2 Advanced Operators

    Advanced search is a precise way to search documents. The advanced search

    query syntax consists of terms that are connected by Boolean, nested

    operators. The total number of matches returned is an exact, not an estimated,

    number.

    Note: Boolean operators are case-sensitive.

    Table 3-1: Common Boolean Operators

    Operator Description

    AND Takes two or more arguments. True only if all of the arguments are true.

    OR Takes two or more arguments. True if at least one argument is true.

    NOT Takes one argument. True if present under ANDand the argument is false.

    Note: The NOToperator is used in the context of an AND, and not an ORoperator.

    MI N_n Takes one or more arguments. True if at least narguments are true.

    MI NOC_n Takes one or more arguments. True if the total number of occurrences ofthe arguments in the document is at least n.

    DI ST_n Takes two or more arguments. True if all of the arguments occur withinthe specified number of words.

    NOTDI ST Takes two or more arguments. True if all of the arguments occur in thesame document, but not within the specified number of words.

    ORD Takes two or more arguments. True if all of the arguments occur in thesame order specified by the rule.

    ORDDI ST Takes two or more arguments. True if all of the arguments occur in thesame order specified by the rule, and within the specified number of

    words from each other.

  • 8/9/2019 irsug sas

    20/28

    14 SAS Information Retrieval Studio: Users Guide

    START True if all the arguments occur within the specified number of words fromthe start of the document.

    END True if all of its arguments appear within the specified number of wordsfrom the end of the document.

    NOTI N True if the first argument matches, and if this match is not part of thespecified larger string in the second argument.

    Table 3-1: Common Boolean Operators (Continued)

  • 8/9/2019 irsug sas

    21/28

    15

    Appendixes

    - Appendix A:Recommended Reading on page 17

    - Appendix B: Glossary on page 19

  • 8/9/2019 irsug sas

    22/28

    16 SAS Information Retrieval Studio: Users Guide

  • 8/9/2019 irsug sas

    23/28

    17

    Appendix: A

    Recommended Reading

    The following books are recommended as companion guides:- SAS Information Retrieval Studio: Installation Guide: Install SAS

    Information Retrieval Studio and prerequisite software.

    - SAS Information Retrieval Studio: Administrators Guide: Use this

    book to select the components of your custom information retrieval

    pipeline. Configure these components that include the search

    window that end users use to query an index.

    - SAS offers instructor-led training and self-paced e-learning courses to

    help you get started with the SAS add-in, learn how the SAS add-in

    works with the other products in the SAS Enterprise Intelligence

    Platform, and learn how to run stored processes in the SAS add-in.

    For more information about the courses available, seesuppor t . sas. com/ t r ai ni ng.

    For a complete list of SAS publications, see the current SAS Publishing

    Catalog. To order the most current publications or to receive a free copy of the

    catalog, contact a SAS representative at

    SAS Publishing Sales

    SAS Campus Drive

    Cary, NC 27513

    Telephone: (800) 727-3228*

    Fax: (919) 677-8166

    E-mail: [email protected]

    Web address:suppor t . sas . com/ pubs

    * For other SAS Institute business, call (919) 677-8000.

    Customers outside the United States should contact their local SAS office.

    http://www.support.sas.com/traininghttp://www.support.sas.com/pubshttp://www.support.sas.com/pubshttp://www.support.sas.com/training
  • 8/9/2019 irsug sas

    24/28

    18 SAS Information Retrieval Studio: Users Guide

  • 8/9/2019 irsug sas

    25/28

    19

    Appendix: BGlossary

    Boolean operators

    specifies words such as AND, OR, and NOT, to construct logical definitions

    that locate the matches that you seek.

    categorization

    concisely defines the subject matter of a document, in other words, the

    main idea or subject of the document.

    concept

    specifies any of the followinga string, token, or an argumentto locate

    in an input document. A concept locates the metadata of the input

    document.

    fact

    links two, or more, concepts to provide otherwise overlooked relationshipsin input documents.

    string

    refers to a group of words or characters that you specify.

  • 8/9/2019 irsug sas

    26/28

  • 8/9/2019 irsug sas

    27/28

    21

    Index

    A

    AND operator ...........................................................................................................13

    D

    DIST_n operator .......................................................................................................13

    E

    END operator ............................................................................................................14

    L

    labels

    hierarchy ..............................................................................................................7

    M

    MIN_n operator ........................................................................................................13

    MINOC_n operator ...................................................................................................13

    N

    no hierarchysearch returns .......................................................................................................9

    no labels

    search returns .......................................................................................................6

    NOT operator .............. ................ ............... .............. ................ .............. ............... .... 13

    NOTDIST operator .............. ............... .............. ............... .............. ............... ............ 13

    NOTIN operator .............. ............... ................ .............. ............... .............. ............... 14

  • 8/9/2019 irsug sas

    28/28

    22 SAS Information Retrieval Studio: Users Guide

    O

    OR operator .............................................................................................................. 13

    ORD operator ........................................................................................................... 13

    S

    START operator ....................................................................................................... 14