Screenshot capture feature #270

PSNAppz · 2023-01-12T16:07:27Z

Is your feature request related to a problem? Please describe.
Screenshot capturing is a useful feature that can be added to an OSINT tool, as it allows the tool to take screenshots of the pages it crawls and save them to the database or file system. This can be useful for creating a visual record of the pages that have been crawled, which can be helpful for documenting the results of the crawling process. Additionally, it can be used for creating an archive of the crawled pages, which can be useful for analyzing changes over time.

Describe the solution you'd like
With this feature, the tool can take screenshots at different resolution, different viewport, and even capture the whole webpage using a library such as puppeteer, Selenium, etc.

Describe alternatives you've considered
N/A

Additional context
It can also be useful for creating a visual comparison of the pages before and after a specific event.

KingAkeem · 2023-01-13T21:50:24Z

If anyone wants to take on this task before I do, here's some context.

You should make use of the LinkTree class which uses treelib to construct a tree data structure that can be printed, downloaded use tree operations such as searching the tree.
https://github.com/DedSecInside/TorBot/blob/dev/torbot/modules/linktree.py

Using the class requires passing the root node of the tree and how far you would like the tree to be built, depth-wise

tree = LinkTree(root = "https://www.example.com", depth = 2) # builds tree on instantiation 
tree.show() # prints tree to std output
tree.save("test.txt") # saves tree results to `test.txt`

The tree nodes currently only save the URL, but treelib has a mechanism to extend nodes to store data.
https://treelib.readthedocs.io/en/latest/index.html#advanced-usage

class WebMetadata(object):
  def __init__(self, html, headers): 
            self.html = html
            self.headers = headers

# using treelib library
tree = Tree()
resp = requests.get("https://www.example.com")
tree.create_node("root", "root", data=WebMetadata(resp.text, resp.headers)) # passing html and headers

pavankalyan767 · 2023-10-15T05:12:58Z

is this issue closed or open ?

PSNAppz · 2023-10-15T06:07:51Z

@pavankalyan224847 This is open and not assigned to anyone.

KingAkeem · 2023-10-15T12:31:11Z

This comment #270 (comment) is out of date.

The LinkTree does still exist but has been refactored completely, you can check out the refactored code here.

https://github.com/DedSecInside/TorBot/blob/dev/torbot/modules/linktree.py

Let me know if you have any questions.

pavankalyan767 · 2023-10-15T12:40:34Z

can you assign this to me i would like to work on it

KingAkeem · 2023-10-15T13:16:47Z

@pavankalyan224847 Done!

KingAkeem · 2023-10-27T14:32:28Z

Updates?

PSNAppz added Enhancement Idea Good First Issue Hacktoberfest HackToberFest 2023 labels Jan 12, 2023

PSNAppz mentioned this issue Oct 7, 2023

Move log level from environment variable to CLI flag #292

Closed

KingAkeem assigned pavankalyan767 Oct 15, 2023

KingAkeem added this to TorBot v4.1.0 Oct 17, 2023

KingAkeem added the Ongoing label Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Screenshot capture feature #270

Screenshot capture feature #270

PSNAppz commented Jan 12, 2023

KingAkeem commented Jan 13, 2023

pavankalyan767 commented Oct 15, 2023

PSNAppz commented Oct 15, 2023

KingAkeem commented Oct 15, 2023

pavankalyan767 commented Oct 15, 2023

KingAkeem commented Oct 15, 2023

KingAkeem commented Oct 27, 2023

Screenshot capture feature #270

Screenshot capture feature #270

Comments

PSNAppz commented Jan 12, 2023

KingAkeem commented Jan 13, 2023

pavankalyan767 commented Oct 15, 2023

PSNAppz commented Oct 15, 2023

KingAkeem commented Oct 15, 2023

pavankalyan767 commented Oct 15, 2023

KingAkeem commented Oct 15, 2023

KingAkeem commented Oct 27, 2023