thr3ads.net - llvm dev - [llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode [Mar 2016]

If this information is useful, please help other people find it:
Share via:

Abhinav Tripathi via llvm-dev

2016-Mar-03 17:25 UTC

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

Hello,
I am Abhinav Tripathi, B.Tech 3rd Year student from IIT Indore, India. I
was looking on the projects ideas page of llvm and saw that I could also
propose to work on the SAFECode Open projects. As I found no mailing list
on their site, I am sending this message here. Please redirect me to some
other list, if required.
.
I found most of the projects quite alluring as I have been working on a
similar project (in terms of tasks) since the last GSoC. It is CPPSharp (
https://github.com/genuinelucifer/CppSharp).
.
The ones that I found most interesting were:
1 - Improve Static Array Bounds Checking -- Because I have done a lot of
array related tasks while writing marshalling code for CppSharp. I think I
can really contribute into this project.
.
2 - Create a simpler CompleteChecks pass -- Although I admit, I didn't
quite understand what simpler would mean here. But it seems fairly
challenging (if it's regarding optimisation) or involving understanding of
a large part of codebase (in which case too it intrigues me). I would love
to work on this one too.
.
If the mentor(s) or anyone else would like to state a few proficiency tests
to prove my aptness for the projects, I would love to submit a couple of
patches before applying for GSoC.
.
Regards,
Abhinav
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20160303/35de04fc/attachment.html>

John Criswell via llvm-dev

2016-Mar-16 15:56 UTC

head link

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

On 3/3/16 12:25 PM, Abhinav Tripathi via llvm-dev wrote:> Hello,
> I am Abhinav Tripathi, B.Tech 3rd Year student from IIT Indore, India. 
> I was looking on the projects ideas page of llvm and saw that I could 
> also propose to work on the SAFECode Open projects. As I found no 
> mailing list on their site, I am sending this message here. Please 
> redirect me to some other list, if required.
The most useful project for SAFECode right now is to update its code to 
work with either LLVM 3.7 or LLVM 3.8.  I had a student work on this 
last summer (code is at https://github.com/jtcriswell/safecode-llvm37), 
but it needs to be completed and tested.  On my end, I'm interested in 
getting SAFECode dusted off because I'd like to use it for research 
projects that need to attach metadata to memory objects.

Until SAFECode is updated to a newer version of LLVM, its utility is 
pretty limited, and any projects to enhance it will basically require 
that it be updated to a newer version of LLVM.
> .
> I found most of the projects quite alluring as I have been working on 
> a similar project (in terms of tasks) since the last GSoC. It is 
> CPPSharp (https://github.com/genuinelucifer/CppSharp).
> .
> The ones that I found most interesting were:
> 1 - Improve Static Array Bounds Checking -- Because I have done a lot 
> of array related tasks while writing marshalling code for CppSharp. I 
> think I can really contribute into this project.
Static Array Bounds Checking requires that you understand static 
analysis.  Multiple static analysis methods are applicable: range 
analysis, integer linear programming, SMT solvers, etc.  For a 
successful proposal for static array bounds checking, you should know 
which algorithm you will implement and be able to explain why you think 
it will work well.  For SAFECode, the algorithm must be sound with 
respect to two's complement arithmetic (i.e., the algorithm must take 
into account that integers in C can experience underflow or overflow 
when used in arithmetic).
> .
> 2 - Create a simpler CompleteChecks pass -- Although I admit, I didn't 
> quite understand what simpler would mean here. But it seems fairly 
> challenging (if it's regarding optimisation) or involving 
> understanding of a large part of codebase (in which case too it 
> intrigues me). I would love to work on this one too.
The CompleteChecks pass currently uses the DSA points-to analysis (which 
is large and complicated).  There are simpler analyses that one could do 
to determine whether a memory object is read or written by external 
code.  For example, a simple intra-procedural analysis could determine 
if a memory object is allocated and only used by the current function, 
and a simple inter-procedural analysis could create a very simple heap 
abstraction and perform data-flow analysis on the pointers contained 
within heap objects to determine if they are influenced by external 
library code.  Basically, there are some simple quick analyses that 
would be imprecise but could probably find memory objects that are not 
manipulated by external code.
> .
> If the mentor(s) or anyone else would like to state a few proficiency 
> tests to prove my aptness for the projects, I would love to submit a 
> couple of patches before applying for GSoC.
I think the only proficiency test is whether you can show in your 
proposal that you have the necessary programming skills and background 
information to be able to do what you propose.  In both of these 
projects, if you're not familiar with static analysis (e.g., Kam/Ulman 
data-flow analysis), then you're likely not ready for these projects.  
For the two projects you mentioned, I would also expect existing 
familiarity with LLVM.

Again, though, the best project is probably to update SAFECode to a 
modern version of LLVM.

Regards,

John Criswell
> .
> Regards,
> Abhinav
>
>
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev

-- 
John Criswell
Assistant Professor
Department of Computer Science, University of Rochester
http://www.cs.rochester.edu/u/criswell

-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20160316/bc27e6c5/attachment.html>

Abhinav Tripathi via llvm-dev

2016-Mar-19 06:55 UTC

head link

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

Sorry, forgot to add the mailing list....

Hi,
Thanks for the detailed response.
.
>
> The most useful project for SAFECode right now is to update its code to
> work with either LLVM 3.7 or LLVM 3.8.  I had a student work on this last
> summer (code is at https://github.com/jtcriswell/safecode-llvm37), but it
> needs to be completed and tested.  On my end, I'm interested in getting
> SAFECode dusted off because I'd like to use it for research projects
that
> need to attach metadata to memory objects.
>
> Until SAFECode is updated to a newer version of LLVM, its utility is
> pretty limited, and any projects to enhance it will basically require that
> it be updated to a newer version of LLVM.
>
I would definitely love to work on this, I do have experience with using
LLVM (atleast till the last version) for the past 1 year. I couldn't
completely compile and get the code (int the link) to work though. But, I
would like to try and work on it's completion.


>
> Static Array Bounds Checking requires that you understand static
> analysis.  Multiple static analysis methods are applicable: range analysis,
> integer linear programming, SMT solvers, etc.  For a successful proposal
> for static array bounds checking, you should know which algorithm you will
> implement and be able to explain why you think it will work well.  For
> SAFECode, the algorithm must be sound with respect to two's complement
> arithmetic (i.e., the algorithm must take into account that integers in C
> can experience underflow or overflow when used in arithmetic).
>

I have experience with Integer linear programming using octave and matlab.
I also developed very basic solvers of my own.  Would it be enough for this
project or should I focus on updating SAFECode to newer version?


The CompleteChecks pass currently uses the DSA points-to analysis (which
is> large and complicated).  There are simpler analyses that one could do to
> determine whether a memory object is read or written by external code.  For
> example, a simple intra-procedural analysis could determine if a memory
> object is allocated and only used by the current function, and a simple
> inter-procedural analysis could create a very simple heap abstraction and
> perform data-flow analysis on the pointers contained within heap objects to
> determine if they are influenced by external library code.  Basically,
> there are some simple quick analyses that would be imprecise but could
> probably find memory objects that are not manipulated by external code.
>
This seems fairly daunting task. Although, I love challenges but since the
last date of sending proposal is very close I don't think I can read and
understand much in this aspect of code analysis to write a decent proposal.


I think the only proficiency test is whether you can show in your
proposal> that you have the necessary programming skills and background information
> to be able to do what you propose.  In both of these projects, if
you're
> not familiar with static analysis (e.g., Kam/Ulman data-flow analysis),
> then you're likely not ready for these projects.  For the two projects
you
> mentioned, I would also expect existing familiarity with LLVM.
>
> Again, though, the best project is probably to update SAFECode to a modern
> version of LLVM.
>

Thanks. I am definitely inclined to updating SAFECode for now. I will
submit the proposal to google within a couple of days. I hope you could
provide some pointers in my proposal before the end date so that I could
improve it.
I have existing familiarity with llvm and clang as I used it for working on
a project to marshal codes from C/C++ to C# and C++/CLI.

Regards,
Abhinav

>
> Regards,
>
> John Criswell
>
On Sat, Mar 19, 2016 at 12:24 PM, Abhinav Tripathi <ee130002001 at
iiti.ac.in>
wrote:
> Hi,
> Thanks for the detailed response.
> .
>
>>
>> The most useful project for SAFECode right now is to update its code to
>> work with either LLVM 3.7 or LLVM 3.8.  I had a student work on this
last
>> summer (code is at https://github.com/jtcriswell/safecode-llvm37), but
>> it needs to be completed and tested.  On my end, I'm interested in
getting
>> SAFECode dusted off because I'd like to use it for research
projects that
>> need to attach metadata to memory objects.
>>
>> Until SAFECode is updated to a newer version of LLVM, its utility is
>> pretty limited, and any projects to enhance it will basically require
that
>> it be updated to a newer version of LLVM.
>>
>
> I would definitely love to work on this, I do have experience with using
> LLVM (atleast till the last version) for the past 1 year. I couldn't
> completely compile and get the code (int the link) to work though. But, I
> would like to try and work on it's completion.
>
>
>
>>
>> Static Array Bounds Checking requires that you understand static
>> analysis.  Multiple static analysis methods are applicable: range
analysis,
>> integer linear programming, SMT solvers, etc.  For a successful
proposal
>> for static array bounds checking, you should know which algorithm you
will
>> implement and be able to explain why you think it will work well.  For
>> SAFECode, the algorithm must be sound with respect to two's
complement
>> arithmetic (i.e., the algorithm must take into account that integers in
C
>> can experience underflow or overflow when used in arithmetic).
>>
>
>
> I have experience with Integer linear programming using octave and matlab.
> I also developed very basic solvers of my own.  Would it be enough for this
> project or should I focus on updating SAFECode to newer version?
>
>
> The CompleteChecks pass currently uses the DSA points-to analysis (which
>> is large and complicated).  There are simpler analyses that one could
do to
>> determine whether a memory object is read or written by external code. 
For
>> example, a simple intra-procedural analysis could determine if a memory
>> object is allocated and only used by the current function, and a simple
>> inter-procedural analysis could create a very simple heap abstraction
and
>> perform data-flow analysis on the pointers contained within heap
objects to
>> determine if they are influenced by external library code.  Basically,
>> there are some simple quick analyses that would be imprecise but could
>> probably find memory objects that are not manipulated by external code.
>>
>
> This seems fairly daunting task. Although, I love challenges but since the
> last date of sending proposal is very close I don't think I can read
and
> understand much in this aspect of code analysis to write a decent proposal.
>
>
> I think the only proficiency test is whether you can show in your proposal
>> that you have the necessary programming skills and background
information
>> to be able to do what you propose.  In both of these projects, if
you're
>> not familiar with static analysis (e.g., Kam/Ulman data-flow analysis),
>> then you're likely not ready for these projects.  For the two
projects you
>> mentioned, I would also expect existing familiarity with LLVM.
>>
>> Again, though, the best project is probably to update SAFECode to a
>> modern version of LLVM.
>>
>
>
> Thanks. I am definitely inclined to updating SAFECode for now. I will
> submit the proposal to google within a couple of days. I hope you could
> provide some pointers in my proposal before the end date so that I could
> improve it.
> I have existing familiarity with llvm and clang as I used it for working
> on a project to marshal codes from C/C++ to C# and C++/CLI.
>
> Regards,
> Abhinav
>
>
>>
>> Regards,
>>
>> John Criswell
>>
>> John Criswell
>> Assistant Professor
>> Department of Computer Science, University of
Rochesterhttp://www.cs.rochester.edu/u/criswell
>>
>>
>-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://lists.llvm.org/pipermail/llvm-dev/attachments/20160319/f50e52cd/attachment-0001.html>

Seemingly Similar Threads

Search for more seemingly similar threads

llvm dev - Mar 2016 - [GSoC16] Seeking Guidance for a project regarding SAFECode

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

[llvm-dev] [GSoC16] Seeking Guidance for a project regarding SAFECode

Seemingly Similar Threads