Difference between revisions of "Private:progress-khodabakhshi"

From NMSL
 
(35 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
= Summer 2012 (RA) =
 +
 +
===May===
 +
* Working on my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/thesis/ thesis].
 +
* Addressing TOMCCAP paper comments.
 +
* Reviewing two MM12 short papers.
 +
* Running new experiments to cover change of frame rate, as asked by TOMCCAP reviewer.
 +
* Implementing GUI for two [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/code/java/trunk/CSpider/ Cspider] applications.
 +
 +
= Spring 2012 (RA) =
 +
* Courses:
 +
**CMPT828 Illumination in Images and Video
 +
**CMPT894 Cloud Computing Directed Reading
 +
 +
===April===
 +
* Course project of CMPT828.
 +
* Implementing video registration and copy detection application.
 +
* Preparing the spider demo [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/documents/mm12-3dvcd/doc/mm12.pdf doc].
 +
* Working on Cloud Computing [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/CloudComputing/doc/CloudComputing.pdf report].
 +
 +
===March 30===
 +
* Working on VCD companies and TRECVID [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/documents/VCDCompanies-Survey/VCDCompanies-TRECVID.pdf survey].
 +
 +
* Reviewing the assigned TOMCCAP journal.
 +
 +
* Working on CSpider [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/documents/Report-CSpiderDesign/doc/reportTemplate.pdf design].
 +
 +
* Implementing some parts of createSignatureLib of CSpider.
 +
 +
===Feb 29===
 +
* Working on the TOMCCAP [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/documents/tomccap12/ draft]
 +
 +
* Working on my talk for the 828 course.
 +
 +
* Preparing for MMSys presentation.
 +
 +
* Attended the MMSys conference.
 +
 +
* Working on VCD companies survey.
 +
 +
===Jan 27===
 +
* Working on the TOMCCAP [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-projects/3DVideo/3DVCopyDetection/documents/tomccap12/ draft]
 +
 +
* Exploring a new visual signature which details is explained in my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/Report_1-1-12 report]
 +
 +
* Reading "Introduction to Cloud Computing" book by Timothy Chou.
 +
* Reading "Cloud Application Architectures: Building Applications and Infrastructure in the Cloud" book by George Reese.
 +
 +
= Fall 2011 =
 +
* Internship
 +
 +
= Summer 2011 (RA) =
 +
* Courses:
 +
**CMPT826 (dropped)
 +
 +
===June 14===
 +
* New results added to [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report]
 +
 +
===May 31===
 +
* Implementation part added to [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report].
 +
* Normalization of the score changed and the plot of the new normalized score are updated in the report.
 +
 +
===May 17===
 +
* Reading this paper "Model-Based Joint Bit Allocation Between Texture videos and Depth Maps for 3D Video Coding". Motivation: to see how distorted a synthesized view would be if its texture and depth are distorted to some degree.
 +
 +
* Working on the normalization of the scores, and defining a threshold. So that if the score between two videos are greater than this threshold, they are considered a copy of each other. After defining this threshold, precision and recall of the system can be determined.
 +
 +
:* Probably I will try a sigmoid function to normalize the scores between [0,1]
 +
:* and boost the score of the best matching video, if the first best matching video has a much higher relevance score than the second best matching video. x1 = x1 * (x1/x2)
 +
 +
* What should be done next:
 +
:* Right now the system uses all the frames of the videos, next step would be to use boundary detection algorithm to extract keyframes.
 +
:* Right now I evaluated the performance of the system against view interpolation attack by using actual videos taken from different viewpoints by cameras. Next step would be to synthesize views and the use the synthesized ones and determine their distortion and use them for evaluation.
 +
:* Right now the implementation and evaluation are considering 3D videos consist of one video + its depth. This can be extended to Multiview + Depth. If so, the depth extraction is need.
 +
 
= Spring 2011 (RA) =
 
= Spring 2011 (RA) =
 
* Courses:
 
* Courses:
 
**CMPT 771: Internet Architecture and Protocols
 
**CMPT 771: Internet Architecture and Protocols
 +
 +
===May 9===
 +
* Evaluation part of my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report] has been updated
 +
 +
===April 26===
 +
* Implementing: Combining SIFT and Depth results to make a better decision.
 +
* Next step would be to evaluate it. Different transformations must be applied to query videos and robustness of the system against them should be measured.
 +
 +
===April 18===
 +
* Course work (final exam)
 +
* The "D. Depth map creation for multiview video" section of my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report] is updated.
 +
 +
===April 8===
 +
* Course work (problem set + presentation)
 +
* Exploring depth extraction details
 +
===March 29===
 +
* Implementing the depth signature part of the system. The updated [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report] is here.
 +
 +
===March 8===
 +
* Exploring multiview geometry and depth extraction methods. My [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report] is here.
  
 
===March 1===
 
===March 1===
 
* Continuing implementation
 
* Continuing implementation
 +
I am implementing the proposed algorithm in my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/talks/3DVideoCopyDetection.pptx talk] using the mentioned libraries. The implementation up to and including frame level matching phase is completed till now.
  
 
===Feb 22===
 
===Feb 22===
 
* Continuing implementation
 
* Continuing implementation
There are some reliable implementation of SIFT algorithm.  
+
There are some reliable implementation of SIFT algorithm. The first one is by author of the SIFT, David Lowe. However, his implementation is not open source, and he just distributed a binary file. In other words, this implementation is not flexible and it's not possible to change its parameters, which is necessary in our task, since we have to change its parameters to reduce the number of SIFT features by accepting the most informative ones.  The implementation that I decided to use is [http://www.vlfeat.org/ one] by Andrea Vedaldi, which is in C language, and has Matlab interface as well. The Open-Source SIFT Library won first place at the [http://www.acmmm10.org/2010/10/open-source-software-competition-winners/ ACM Multimedia 2010 Open-Source Software Competition] (the competition was stiff). It is flexible enough for our task. The two parameters that we need to change are the peak threshold, and the edge threshold, which can be change using this library.
 +
 
 +
For the approximate nearest neighbor task, I have decided to use open source [http://www.cs.ubc.ca/~mariusm/index.php/FLANN/FLANN FLANN] library, which is one of the fastest nearest neighbor algorithms, as I have explained in more details in my [https://cs-nsl-svn.cs.surrey.sfu.ca/cssvn/nsl-members/khodabakhshi/reports/3-1-11/reportTemplate.pdf report]. This library is in C++ language, and also have Matlab interface as well.
 +
 
 +
I have learned to use these two libraries using their Matlab interface. The reason is that this task needs some kind of matrix manipulation, and Matlab is efficient in this case.
  
 
===Feb 15===
 
===Feb 15===

Latest revision as of 14:18, 23 May 2012

Summer 2012 (RA)

May

  • Working on my thesis.
  • Addressing TOMCCAP paper comments.
  • Reviewing two MM12 short papers.
  • Running new experiments to cover change of frame rate, as asked by TOMCCAP reviewer.
  • Implementing GUI for two Cspider applications.

Spring 2012 (RA)

  • Courses:
    • CMPT828 Illumination in Images and Video
    • CMPT894 Cloud Computing Directed Reading

April

  • Course project of CMPT828.
  • Implementing video registration and copy detection application.
  • Preparing the spider demo doc.
  • Working on Cloud Computing report.

March 30

  • Working on VCD companies and TRECVID survey.
  • Reviewing the assigned TOMCCAP journal.
  • Implementing some parts of createSignatureLib of CSpider.

Feb 29

  • Working on the TOMCCAP draft
  • Working on my talk for the 828 course.
  • Preparing for MMSys presentation.
  • Attended the MMSys conference.
  • Working on VCD companies survey.

Jan 27

  • Working on the TOMCCAP draft
  • Exploring a new visual signature which details is explained in my report
  • Reading "Introduction to Cloud Computing" book by Timothy Chou.
  • Reading "Cloud Application Architectures: Building Applications and Infrastructure in the Cloud" book by George Reese.

Fall 2011

  • Internship

Summer 2011 (RA)

  • Courses:
    • CMPT826 (dropped)

June 14

May 31

  • Implementation part added to report.
  • Normalization of the score changed and the plot of the new normalized score are updated in the report.

May 17

  • Reading this paper "Model-Based Joint Bit Allocation Between Texture videos and Depth Maps for 3D Video Coding". Motivation: to see how distorted a synthesized view would be if its texture and depth are distorted to some degree.
  • Working on the normalization of the scores, and defining a threshold. So that if the score between two videos are greater than this threshold, they are considered a copy of each other. After defining this threshold, precision and recall of the system can be determined.
  • Probably I will try a sigmoid function to normalize the scores between [0,1]
  • and boost the score of the best matching video, if the first best matching video has a much higher relevance score than the second best matching video. x1 = x1 * (x1/x2)
  • What should be done next:
  • Right now the system uses all the frames of the videos, next step would be to use boundary detection algorithm to extract keyframes.
  • Right now I evaluated the performance of the system against view interpolation attack by using actual videos taken from different viewpoints by cameras. Next step would be to synthesize views and the use the synthesized ones and determine their distortion and use them for evaluation.
  • Right now the implementation and evaluation are considering 3D videos consist of one video + its depth. This can be extended to Multiview + Depth. If so, the depth extraction is need.

Spring 2011 (RA)

  • Courses:
    • CMPT 771: Internet Architecture and Protocols

May 9

  • Evaluation part of my report has been updated

April 26

  • Implementing: Combining SIFT and Depth results to make a better decision.
  • Next step would be to evaluate it. Different transformations must be applied to query videos and robustness of the system against them should be measured.

April 18

  • Course work (final exam)
  • The "D. Depth map creation for multiview video" section of my report is updated.

April 8

  • Course work (problem set + presentation)
  • Exploring depth extraction details

March 29

  • Implementing the depth signature part of the system. The updated report is here.

March 8

  • Exploring multiview geometry and depth extraction methods. My report is here.

March 1

  • Continuing implementation

I am implementing the proposed algorithm in my talk using the mentioned libraries. The implementation up to and including frame level matching phase is completed till now.

Feb 22

  • Continuing implementation

There are some reliable implementation of SIFT algorithm. The first one is by author of the SIFT, David Lowe. However, his implementation is not open source, and he just distributed a binary file. In other words, this implementation is not flexible and it's not possible to change its parameters, which is necessary in our task, since we have to change its parameters to reduce the number of SIFT features by accepting the most informative ones. The implementation that I decided to use is one by Andrea Vedaldi, which is in C language, and has Matlab interface as well. The Open-Source SIFT Library won first place at the ACM Multimedia 2010 Open-Source Software Competition (the competition was stiff). It is flexible enough for our task. The two parameters that we need to change are the peak threshold, and the edge threshold, which can be change using this library.

For the approximate nearest neighbor task, I have decided to use open source FLANN library, which is one of the fastest nearest neighbor algorithms, as I have explained in more details in my report. This library is in C++ language, and also have Matlab interface as well.

I have learned to use these two libraries using their Matlab interface. The reason is that this task needs some kind of matrix manipulation, and Matlab is efficient in this case.

Feb 15

  • Starting implementation

Feb 8

  • Exploring different nearest neighbor techniques in high dimensional spaces

Feb 1

  • Exploring depth maps features

Jan 24

  • Adapting 2D CBVCD to 3D
  • Preparing the group meeting talk: Slides.

Jan 17

  • Exploring image registration methods
  • Exploring feature-based image registration methods
  • Exploring feature-based image registration methods that use SIFT-based methods
  • A more detailed report is available at: report but it is still changing...

Jan 10

  • Exploring 3D videos.
  • Exploring literature for 3D video copy detection

Jan 3

  • having fun, not really working :)

Fall 2010 (TA)

  • Courses:
    • CMPT 701: Design/Analysis Algorithms
    • CMPT 820: Multimedia Systems


  • Worked on 2D video copy detection.