Machine Learning Research Group's Internal FAQ
About this FAQ
- Surviving the Graduate Program
- UTCS graduate program overview
- Courses
- Working with Ray
Typically you get started with research by doing a project for a class such
as CS388 (Natural Language Processing). Here is a
list of project suggestions for that class. After your initial success in this class, you
usually continue the research with conference courses for independent studies (pass/fail-based).
You are supposed to meet with Ray regularly about once a week to discuss on problems, ideas,
results, and etc. Please contact Ray
directly to make an appointment if you want to work with him.
- Master's thesis
- PhD Program
- PhD proposal
- PhD dissertation
- PhD defense
- Job Search
- Doing Research
- Writing & Publishing papers
- Going to conferences
- Presenting papers
- See /u/ml/slides for slide samples
- Please be sure to schedule a practice talk before you go to
conferences for presentation. Here is a ACES seminar room
preference list. You can go to the
ACES seminar facilities web site and make a request for
a reservation.
- Best: ACES 3.408 (largest, seats 16-28)
- Good: ACES 3.116 (seats 14-24; large table)
- Good: ACES 2.404B (like 3.116, but high ceiling)
- Good: ACES 5.116, 6.116 (seats 12-22; smaller table)
- Good: ACES 2.444, 5.444, 6.442 (seats 12-20; smaller room)
- Bad: ACES 3.336, 5.336, 6.336 (2 small displays, no screen)
- Worst: ACES 3.256, 6.256 (seats 6, no projection)
- Presenting posters
- AAAI/SIGART Doctoral Consortium
- The Doctoral Consortium (DC) provides an opportunity for a group of Ph.D. students to discuss and
explore their research interests and career objectives with a panel of established researchers in
AI. The DC is usually held as a workshop before the start of the AAAI or IJCAI
conference. It is recommended to participate in the DC when you just had your proposal accepted.
See 2008 Call for Applications
for more information.
- AAAI Student Programs
- The AAAI has a dedicated student program (see the announcement for 2008) that allows students to submit an abstract of a work that is in progress.
- Useful Articles & Links
- ML Group Resources
- ML Group job assignments
- Office space
- Most of the students have offices in CSA and ENS basement labs.
The office assignments are done in a central manner. The CS
Department Space Committee meets and assigns students to
offices once a year, usually during the summer.
- The lunar email list and user group
- You can send your email to lunar@cs
if you want to broadcast an email to the all members of the group. You don't have to
duplicate it to Ray since
an email to the group alias also reaches him. You can always check who's on the
list by saying "showalias lunar" on a cs host.
- You should also belong to the user group "lunar" so that you can access
the files maintained by the group. Check if you belong the lunar group
by saying "groups [username]".
- Please be sure to be on the "ai-students" alias (/u/ai-lab/mail-lists/ai-students),
which deals with more general topics. If you want to be added to the list,
please send an email to risto@cs.
- The /u/ml directory
- GUI for IE tagging: See /u/ml/groupcode/ie. The GUI needs Perl/Tk,
so you might have to change the first line of code accordingly
(by adding -I directives).
- Group meeting
- We usually meet bi-weekly, in one of the ACES high-tech seminar rooms.
- Group tutorials
(VNC broadcasting, Weka, Servlets, CVS, Swing HTML support)
Also check out the Technical Tips
- Computing Resources (Software, Datasets and Hardware)
- Machine assignments
- Local disk space
- The /var/local directory on each computer is mapped to its local disk,
and usually by default one cannot put data there. However after sending
an email to gripe@cs, you will be given the right to write on the local
disk of your computer. Beware that this local space is not backed up.
- WEKA
- Datasets
- Matlab/Octave
- Matlab is a general tool for doing
numerical computations with matrices and vectors. Read the man page (
type "man matlab") on the cs machines to get started. Octave
is a freely-available GNU program designed for numerical computations. Most Matlab
code also run on Octave.
- Condor: Running parallelized jobs in CS machines
- Computing Clusters (i.e. Mastodon)
- Mastadon cluster machines are available to everyone with a cs account through the
Condor job management system, which also manages jobs on the Scout
cluster and the linux desktops.
Technical Tips
- Also check out the Group Tutorials
- Emacs changes the group and the permissions on any file that I edit. How do
I prevent this?
Add the following lines to your .emacs file:
(setq
backup-by-copying nil
backup-by-copying-when-linked t
backup-by-copying-when-mismatch t)
- Group CVS directory
The group CVS repository is in /u/ml/groupcode/cvsroot. If
you start projects there, please be sure to change the default group to
lunar by executing 'chgrp -R lunar dir' on every dir you create. All CVS
documentation is located here.
For more information, visit our Mini
CVS tutorial page.
- Policy for software installation in the group directory
There are two things to keep in your mind when you put something in the /u/ml
directory.
- Have all the directories in /u/ml/software be u+rwX and g+rwX, but
o-rwX. This would allow group members to make modifications to the
installations e.g. making minor (documented) changes to Makefiles,
gunzipping files unused for a long time, etc. At the same time, it would
not provide such access to the general public.
It can be achieved for directory "foo" and all its subdirectories by
using the command:
chmod -R ug+rwX,o-rwX foo
- Have all the directories be owned by "lunar".
It can be achieved for directory "foo" and all its subdirectories by
using the command:
chgrp -R lunar foo
- How do I change my default group to lunar or fix all files
in a directory to a group?
- Checking CS email from Web
There is a way to check CS emails from web using UT's webmail
facility. This is probably useful from Internet cafes which don't
let you telnet etc..
- Go to https://webmail.utexas.edu/
- Give your UT EID and password
- In the new page, under "Custom Login" give your CS login and password,
write "imap.cs.utexas.edu" for server and "Inbox" for Folder.
-
I get the following URL exception error while running weka.gui.explorer.Explorer
in VNC (or from an X-shell running on home computers). How do I fix it?
goya.cs.utexas.edu 101> java weka.gui.explorer.Explorer
Warning: No configuration property found in
weka/gui/GenericObjectEditor.props
for weka.experiment.InstanceQuery
Warning: Can't find public property editor for property "options"
(class "[Ljava.lang.String;").
Skipping.
Setting numeric precision to: 10
Setting axis colour to: green
Setting background colour to: black
Setting attribute bar colour to: black
Displaying attribute bars
Uncaught error fetching image:
java.lang.NullPointerException
at sun.awt.image.URLImageSource.getConnection(URLImageSource.java:99)
at sun.awt.image.URLImageSource.getDecoder(URLImageSource.java:108)
at sun.awt.image.InputStreamImageSource.doFetch(InputStreamImageSource.java:251)
at sun.awt.image.ImageFetcher.fetchloop(ImageFetcher.java:168)
at sun.awt.image.ImageFetcher.run(ImageFetcher.java:136)
goya.cs.utexas.edu 102> exit
- You might want to change the CLASSPATH. Be sure to incude the current
directory, denoted by a period (.) in your classpath, as in: -classpath .:classes.
Click here
for more information on this problem.
-
For those of you who are tired of seeing Java flush the annoying "Couldn't
get file lock" exception when running Weka, the workaround is to run Java
with this command-line option:
-Djava.util.prefs.syncInterval=2000000
(e.g. java -Xmx500m -Djava.util.prefs.syncInterval=2000000
weka.gui.experiment.Experimenter)
Here is the
bug description (registering needed).
Sun labeled it fixed in 1.4.1, but this is clearly wrong. Hopefully they
will fix it in the next release.
Here is the page
that explains the workaround.
-
How to get a clean pdf file
Please use "dvips -Ppdf" when you convert your .dvi file to .ps file.
- How to produce a pdf file with embedded fonts that does not contain Type 3 fonts?
Instructions from AAAI
Suppose your document is named paper.tex. Run the following commands:
dvips -Ppdf -G0 -tletter paper
ps2pdf -dPDFSETTINGS=/printer -dCompatibilityLevel=1.4 -dMaxSubsetPct=0 \
-dSubsetFonts=false -dEmbedAllFonts=true -sPAPERSIZE=letter paper.ps
Old Instructions
The following commands should be used:
dvips -f -Ppdf -G0 < file.dvi > file.ps
ps2pdf13 -dPDFSETTINGS=/printer file.ps
To check the fonts of your pdf file, you can do one of the following:
- Open the pdf file with acroread and choose File, Document Properties, Fonts tab. Embedded fonts will be listed as "Embedded." Fonts that are not embedded will not be marked in any way.
- Use the command
pdffonts file.pdf
- How to combine several pdf files into one pdf file?
Suppose you have two pdf files a.pdf and b.pdf and you would like to combine them into combined.pdf. The following command does the trick:
gs -dNOPAUSE -sDEVICE=pdfwrite -sOUTPUTFILE=combined.pdf -dEmbedAllFonts=true \
-dSubsetFonts=false -dBATCH a.pdf b.pdf
In addition, it forces all fonts to be embedded.
- How to convert a powerpoint slide (or figures in a pdf file) into an eps file to be embedded in your latex document?
Since you may find creating figures easier using powerpoint or you want to use existing slides in your document, follow these steps.
- Save the powerpoint slides into a pdf (print -> save as PDF)
- Open the pdf using gimp
- Select to import the page you want
- Use Tools->Selection Tools to perform any cropping (NOTE: Use Image->Crop to Selection instead of Layer->Crop to Selection)
- Save the cropped figure as an eps file
To use a figure in an existing pdf file, simply follow the same steps starting with opening the file using gimp.
UTCS ML Group
For more information or suggestions, please contact the
FAQ manager
Last modified: December 21, 2010
(Always under construction)