irsug sas
-
Upload
alexandra-stan -
Category
Documents
-
view
218 -
download
0
Transcript of irsug sas
-
8/9/2019 irsug sas
1/28
-
8/9/2019 irsug sas
2/28
-
8/9/2019 irsug sas
3/28
iii
Contents
About Th is Book ....................................................................................... v
Audience ................ .............. ............... .............. ............... ............... .............. ... v
Prerequisites ............... .............. ................ .............. ............... ............... ............ vConventions ............. .............. ................ .............. ............... .............. ................ v
1 About SAS Information Retrieval Studio .............................................1
1.1 What Is SAS Information Retrieval Studio? ........ ................ ............... ....... 1
1.2 Benefits of Using SAS Information Retrieval Studio ......... ............... ........ 2
1.3 How Does SAS Information Retrieval Studio Work? .............. .............. ... 3
1.4 How Does SAS Information Retrieval Studio Fit into the SAS
Product Line? ............... .............. ............... ............... ............... ............... .......... 3
2 SAS Informat ion Retrieval Studio Search Window .............................5
2.1 Your First Look at the User Window for SAS Information
Retrieval Studio .............. ............... .............. ............... ............... ............... ........ 52.2 Viewing Search Returns ............... ............... ................ ............... ................ 6
2.2.1 Displays with or without Labels ............... ............... ............... ......... 6
2.2.2 No Labels Example ............. ................ ............... .............. ................ 6
2.2.3 Labels Example .............. ............... .............. ............... ............... ....... 7
2.2.4 Hierarchical Labels Example .................... ............... ................ ........ 7
2.2.5 No Hierarchical Display Example ...... .............. ............... .............. .. 9
3 Search Syntax ........................................................................................11
3.1 Two Types of Search Syntax ............... ............... ............... ................ ........ 11
3.2 Simple Search .............. ............... .............. ................ .............. ............... ..... 12
3.2.1 Overview of Simple Search ..... ............... ................ .............. ........... 12
3.2.2 Determining Relevancy ................... ............... .............. ................ ... 12
3.3 Advanced Search ............... .............. ............... .............. ................ .............. 12
3.3.1 Overview of Advanced Search .............. ............... ................ ........... 12
3.3.2 Advanced Operators .............. ............... ................ ............... ............. 13
Appendixes ............................................................................ 15
A Recommended Reading ........................................................................17
B Glossary .................................................................................................19
-
8/9/2019 irsug sas
4/28
iv SAS Information Retrieval Studio: Users Guide
Index ...........................................................................................................21
-
8/9/2019 irsug sas
5/28
v
About This Book
Audience
SAS Information Retrieval Studio: Users Guideis designed for persons who
search the indexes built by the indexing server.
SAS Information Retrieval Studio enables you to specify your search terms
using various operators. This documentation focuses on ways to search the
index to retrieve the information that you seek.
Prerequisites
Here are the prerequisites for using the search window for SAS
Information Retrieval Studio:
- SAS Information Retrieval Studio installed on your machine.
- A supported browser installed on your desktop client.
If you have any questions about whether you are ready to use SAS Information
Retrieval Studio, contact your system administrator.
Conventions
This manual uses the following typographical conventions:
Convention Description
TGM_ROOT The root directory where SAS Information Retrieval Studio isinstalled, typically the following:
Windows:C: / Program Fi l es/ SAS I nf or mat i onRet r i eval Studi oUNIX:/ opt / SAS I nf ormat i on Ret r i eval Studi o
. xml Code examples are shown in a fixed-width font.
-
8/9/2019 irsug sas
6/28
vi SAS Information Retrieval Studio: Users Guide
Searchbutton The labels for user interface controls are shown in a bold, sans-
serif font.
www. sas. com The hypertext links are shown in a light blue, fixed-width font,and are underlined.
Convention Description
http://www.sas.com/http://www.sas.com/http://www.sas.com/ -
8/9/2019 irsug sas
7/28
1About SAS Information
Retrieval Studio- What Is SAS Information Retrieval Studio?
- Benefits of Using SAS Information Retrieval Studio
- How Does SAS Information Retrieval Studio Work?
- How Does SAS Information Retrieval Studio Fit into the SAS Product
Line?
1.1 What Is SAS Information Retrieval Studio?In many organizations, diverse information consumers need to quickly access
specific data. In an environment where data, and its types, grow exponentially
there is a need to search and locate information from numerous sources. SAS
Information Retrieval Studio combines several key technologies to provide a
comprehensive solution to search.
Search
You, the user, can search the indexes created in SAS Information Retrieval
Studio using several types of search syntax.
Specify different types of search syntax
There are two types of search:
Simple
Type words into the search field to locate information. You can also
prefix these words with plus (+) or minus (-) signs.
Advanced
Add Boolean, positional, and counting operators to your search.
-
8/9/2019 irsug sas
8/28
2 SAS Information Retrieval Studio: Users Guide
See and search related documents
See a list of the matching documents in the search interface. Depending on
the choices made by the administrator, these documents might also be
categorized, have concepts extracted, or facts. If the administrator made
any of these selections, the documents are related to labels.
Labels enable you to use facetted search to intuitively locate yourinformation.
1.2 Benefits of Using SAS InformationRetrieval Studio
SAS Information Retrieval Studio provides the following benefits:
Empowers business owners by locating data
SAS Information Retrieval Studio includes functionality that is designed
to fit your organizations requirements. Use this program to locate the data
that you seek.
Improves the business value of IT and the corporate data that it manages
SAS Information Retrieval Studio provides you with easy, self-service
access to the information contained in your documents. Use SAS
Information Retrieval Studio to search your data.
Saves money on training and support costs
SAS Information Retrieval Studio is so simple that you can quickly
become self-sufficient, with minimal IT support and no need for extensive
training. Once you start using SAS Information Retrieval Studio you are
no longer dependent on the IT staff.
-
8/9/2019 irsug sas
9/28
SAS Information Retrieval Studio: Users Guide 3
1.3 How Does SAS Information RetrievalStudio Work?
SAS Information Retrieval Studio is an application that anyone can use to
locate documents in an index.
1.4 How Does SAS Information Retrieval StudioFit into the SAS Product Line?
As an integral part of the SAS product line, SAS Information Retrieval Studio
provides a comprehensive search solution. Use this application to see
matching categories, concepts, and facts that are applied from projects running
on SAS Content Categorization Server. If specified by the administrator, you
can see these classifications for your documents as labels.
-
8/9/2019 irsug sas
10/28
4 SAS Information Retrieval Studio: Users Guide
-
8/9/2019 irsug sas
11/28
2SAS Information Retrieval
Studio Search Window- Your First Look at the User Window for SAS Information Retrieval
Studio
- Viewing Search Returns
2.1 Your First Look at the User Window forSAS Information Retrieval Studio
SAS Information Retrieval Studio provides the search box that you use to locate
the information that you are seeking.
Display 2.1 The User Window
Search box
type your query into this field.
Search button
click to locate matching documents.
Clear button
click to erase the query terms that you entered.
-
8/9/2019 irsug sas
12/28
6 SAS Information Retrieval Studio: Users Guide
2.2 Viewing Search Returns
2.2.1 Displays with or without Labels
The search results that are returned to your query are displayed according to
the requirements specified by the administrator of SAS Information RetrievalStudio.
Labels enable you to use facetted search. Facetted search makes it possible for
you to click on a link, or label, and to see a list of matching documents. If you
do not find the precise results that you seek, click another, related label and
continue this process until you intuitively locate the information that you
require.
2.2.2 No Labels Example
If your administrator chose to use no labels, search results are displayed in list
format. To see a matched document, select the blue hyperlink that appears tothe right of the number in the ordered list.
Display 2.1 No Labels
-
8/9/2019 irsug sas
13/28
SAS Information Retrieval Studio: Users Guide 7
2.2.3 Labels Example
See one example of the labels that your search results might return below:
Display 2.2 Labels Example
2.2.4 Hierarchical Labels Example
If your administrator specified a hierarchical ordering of labels, the matching
sections of the taxonomy for categories are displayed. (Concepts and facts do
not have parent-child relationships. They only appear in list format.)
Categories precede concepts in this list format.
-
8/9/2019 irsug sas
14/28
8 SAS Information Retrieval Studio: Users Guide
This hierarchical structure is a matched portion of the same taxonomy that
appears in the Taxonomy pane of the SAS Content Categorization Studio user
interface. You can also see a count of the matching documents for each label.
Display 2.3 Hierarchical Taxonomy of Matched Labels
Click the left mouse button on a hyperlink label to make one of the following
selections:
Require
labels appear in the SAS Information Retrieval Studio search window. The
displayed documents match both the query term and the selected label. If
you specify more than one label, the documents match the query term and
all of the selected labels.
Exclude
one label, preceded by the minus sign (-) appears in the SAS Information
Retrieval Studio search window. The displayed documents match thequery term, but not the selected labels.
-
8/9/2019 irsug sas
15/28
SAS Information Retrieval Studio: Users Guide 9
View
one label appears in the SAS Information Retrieval Studio search window
that displays all of the matching documents for this label, only.
Remove
this operation is available for the label appearing at the top of the SAS
Information Retrieval Studio search window. This selection is availableonly after you use any of the above operations.
2.2.5 No Hierarchical Display Example
If you do not see a hierarchical display, matching categories and concepts are
displayed in list format. Categories precede concepts in this list format. Click
on a link to see the paths, or parent-child relationships that exist in the SAS
Content Categorization Studio taxonomy that they match.
Display 2.4 No Hierarchy
Note: You can also use the Require, Exclude, View, andRemoveoperations. For more information, see Section2.2.4 Hierarchical Labels Exampleon page 7.
-
8/9/2019 irsug sas
16/28
10 SAS Information Retrieval Studio: Users Guide
-
8/9/2019 irsug sas
17/28
3Search Syntax
- Two Types of Search Syntax
- Simple Search
- Advanced Search
3.1 Two Types of Search Syntax
SAS Information Retrieval Studio processes two types of searches:
Simple
use the traditional type of document search. You input a query term suchas SAS, SAS Information Retrieval Studio, or business solutions to locate
the information that you seek.
Advanced
check with your administrator before you qualify your query terms with
Boolean operators. The Boolean search request includes the use of
Boolean operators such as AND, DIST, or NOT. A Boolean query can
match either a standard field or return an exact match on a Boolean field.
Note: These fields are specified by the administrator of SASInformation Retrieval Studio project.
-
8/9/2019 irsug sas
18/28
12 SAS Information Retrieval Studio: Users Guide
3.2 Simple Search
3.2.1 Overview of Simple Search
Simple search is the traditional way to search documents. A query consists of
a list of terms such as business solutions, search and indexing, or queryprotocol. If at least one query term matches within a document field, the
document is considered to be a match for the query. For example, if the word
searchis located in a document, the document is returned as a match.
3.2.2 Determining Relevancy
The relevancy of each matched document to the query is determined by the
number of matching terms. More matches equal a higher relevancy score for
the document. In addition, terms can also be marked as required or excluded.
Each term can be either a single word or a quoted phrase. If the word or phrase
(minus the quotation marks ['' '']), appears anywhere within the specifiedfields, the document is a match.
3.3 Advanced Search
3.3.1 Overview of Advanced Search
Advanced search is a more complex way to search documents and return
precise results. An advanced query consists of a list of a hosted set of
arguments. (In this case, hosted means arguments that are supported by SASInformation Retrieval Studio.) These arguments can be other operators or
query terms. If at least one query term matches within a document field, and
according to the requirements specified by the operators, the document is a
match. For example, if the word searchis located in a document, this
document is returned as a match.
-
8/9/2019 irsug sas
19/28
SAS Information Retrieval Studio: Users Guide 13
3.3.2 Advanced Operators
Advanced search is a precise way to search documents. The advanced search
query syntax consists of terms that are connected by Boolean, nested
operators. The total number of matches returned is an exact, not an estimated,
number.
Note: Boolean operators are case-sensitive.
Table 3-1: Common Boolean Operators
Operator Description
AND Takes two or more arguments. True only if all of the arguments are true.
OR Takes two or more arguments. True if at least one argument is true.
NOT Takes one argument. True if present under ANDand the argument is false.
Note: The NOToperator is used in the context of an AND, and not an ORoperator.
MI N_n Takes one or more arguments. True if at least narguments are true.
MI NOC_n Takes one or more arguments. True if the total number of occurrences ofthe arguments in the document is at least n.
DI ST_n Takes two or more arguments. True if all of the arguments occur withinthe specified number of words.
NOTDI ST Takes two or more arguments. True if all of the arguments occur in thesame document, but not within the specified number of words.
ORD Takes two or more arguments. True if all of the arguments occur in thesame order specified by the rule.
ORDDI ST Takes two or more arguments. True if all of the arguments occur in thesame order specified by the rule, and within the specified number of
words from each other.
-
8/9/2019 irsug sas
20/28
14 SAS Information Retrieval Studio: Users Guide
START True if all the arguments occur within the specified number of words fromthe start of the document.
END True if all of its arguments appear within the specified number of wordsfrom the end of the document.
NOTI N True if the first argument matches, and if this match is not part of thespecified larger string in the second argument.
Table 3-1: Common Boolean Operators (Continued)
-
8/9/2019 irsug sas
21/28
15
Appendixes
- Appendix A:Recommended Reading on page 17
- Appendix B: Glossary on page 19
-
8/9/2019 irsug sas
22/28
16 SAS Information Retrieval Studio: Users Guide
-
8/9/2019 irsug sas
23/28
17
Appendix: A
Recommended Reading
The following books are recommended as companion guides:- SAS Information Retrieval Studio: Installation Guide: Install SAS
Information Retrieval Studio and prerequisite software.
- SAS Information Retrieval Studio: Administrators Guide: Use this
book to select the components of your custom information retrieval
pipeline. Configure these components that include the search
window that end users use to query an index.
- SAS offers instructor-led training and self-paced e-learning courses to
help you get started with the SAS add-in, learn how the SAS add-in
works with the other products in the SAS Enterprise Intelligence
Platform, and learn how to run stored processes in the SAS add-in.
For more information about the courses available, seesuppor t . sas. com/ t r ai ni ng.
For a complete list of SAS publications, see the current SAS Publishing
Catalog. To order the most current publications or to receive a free copy of the
catalog, contact a SAS representative at
SAS Publishing Sales
SAS Campus Drive
Cary, NC 27513
Telephone: (800) 727-3228*
Fax: (919) 677-8166
E-mail: [email protected]
Web address:suppor t . sas . com/ pubs
* For other SAS Institute business, call (919) 677-8000.
Customers outside the United States should contact their local SAS office.
http://www.support.sas.com/traininghttp://www.support.sas.com/pubshttp://www.support.sas.com/pubshttp://www.support.sas.com/training -
8/9/2019 irsug sas
24/28
18 SAS Information Retrieval Studio: Users Guide
-
8/9/2019 irsug sas
25/28
19
Appendix: BGlossary
Boolean operators
specifies words such as AND, OR, and NOT, to construct logical definitions
that locate the matches that you seek.
categorization
concisely defines the subject matter of a document, in other words, the
main idea or subject of the document.
concept
specifies any of the followinga string, token, or an argumentto locate
in an input document. A concept locates the metadata of the input
document.
fact
links two, or more, concepts to provide otherwise overlooked relationshipsin input documents.
string
refers to a group of words or characters that you specify.
-
8/9/2019 irsug sas
26/28
-
8/9/2019 irsug sas
27/28
21
Index
A
AND operator ...........................................................................................................13
D
DIST_n operator .......................................................................................................13
E
END operator ............................................................................................................14
L
labels
hierarchy ..............................................................................................................7
M
MIN_n operator ........................................................................................................13
MINOC_n operator ...................................................................................................13
N
no hierarchysearch returns .......................................................................................................9
no labels
search returns .......................................................................................................6
NOT operator .............. ................ ............... .............. ................ .............. ............... .... 13
NOTDIST operator .............. ............... .............. ............... .............. ............... ............ 13
NOTIN operator .............. ............... ................ .............. ............... .............. ............... 14
-
8/9/2019 irsug sas
28/28
22 SAS Information Retrieval Studio: Users Guide
O
OR operator .............................................................................................................. 13
ORD operator ........................................................................................................... 13
S
START operator ....................................................................................................... 14