More information
Questions: Whats in the package? | What do I need to run ICECUP? | What can ICECUP do? | Will ICECUP continue to be developed? | Feedback
The ICE-GB R2 Sample Corpus is available for download NOW.
It comes complete with 10 texts selected by Gerry Nelson from the ICE-GB Corpus, and the state-of-the-art ICECUP 3.1.1 software written by Sean Wallis.
The Sample Corpus comes in two flavours:
- The Text sampler equivalent to the old 'complete' sampler.
- The Text+Audio sampler.
WHAT IS IN THE SAMPLE CORPUS PACKAGE?
- Ten texts (over 20,000 words), fully parsed and annotated, exactly as they are in ICE-GB.
- The latest release of ICECUP 3.1. This is a full working version of the software (see below) complete with help.
- Example Fuzzy Tree Fragments.
- Option: Audio for the 5 spoken texts.
The sample contains the following ten texts, shown in the last column. You can view these texts and their classification when you download and install the software. The complete ICE structure is visible from ICECUPs Corpus Map.
Spoken Texts (300) | Dialogues (180) | Private (100) | face-to-face conversations (90) phonecalls (10) |
S1A-010 S1A-094 |
Public (80) | classroom lessons (20) broadcast discussions (20) broadcast interviews (10) parliamentary debates (10) legal cross-examinations (10) business transactions (10) |
|||
Monologues (100) | Unscripted (70) | spontaneous commentaries (20) unscripted speeches (30) demonstrations (10) legal presentations (10) |
S2A-011 | |
Scripted (30) | broadcast talks (20) non-broadcast speeches (10) |
S2B-026 | ||
Mixed (20) | broadcast news (20) |
S2B-002 | ||
Written Texts (200) | Non-printed (50) | Non-professional writing (20) | untimed student essays (10) student examination scripts (10) |
W1A-001 |
Correspondence (30) | social letters (15) business letters (15) |
W1B-001 | ||
Printed (150) | Academic writing (40) | humanities (10) social sciences (10) natural sciences (10) technology (10) |
W2A-005 | |
Non-academic writing (40) | humanities (10) social sciences (10) natural sciences (10) technology (10) |
|||
Reportage (20) | press news reports (20) | W2C-009 | ||
Instructional writing (20) | administrative / regulatory (10) skills / hobbies (10) |
W2D-018 | ||
Persuasive writing (10) | press editorials (10) | |||
Creative writing (20) | novels / stories (20) |
WHAT IS NOT INCLUDED?
Release 2 of ICE-GB is supplied on CD-ROM. ICE-GB contains five hundred texts of spoken and written contemporary British English. To obtain the other 490 texts, you must order the CD-ROM! If you want to do this, click here.
WHAT DO I NEED TO RUN ICECUP 3.1?
The latest version of ICECUP, ICECUP 3.1.1 runs on 32 bit and 64 bit Windows, from Windows XP to Windows 10. You can upgrade to the most recent version for free.
Older versions of ICECUP run on 32 bit Windows, from 3.1 upwards.
Sampler system requirements There are two install packages, with hard disk capacity requirements as follows:
We have tested the software extensively on platforms that we have access to, and we have witnessed it working OK on others. The most up-to-date list is shown below. System requirements for the ICE-GB corpus (CD-ROM) These differ from the above only in terms of hard disk space. You need 96Mb to install the entire corpus. Note that you can also run searches off the CD without installing anything. The software is identical to that supplied with the sample corpus. You can therefore try before you buy. If in doubt, install the sample corpus and software before ordering the CD. |
WILL ICECUP CONTINUE TO BE DEVELOPED?
Yes. ICECUP 3.1.1 is simply the latest release of the software. We have developed and maintained ICECUP for two decades.
COMMENTS, SUGGESTIONS, QUERIES...
The Resources section of this website has sections on Fuzzy Tree Fragments and an explanation of carrying out experiments with parsed corpora.
If you want to ask a question of the author you can email Sean Wallis directly.
This page last modified 14 May, 2020 by Survey Web Administrator.